PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

A61K 39/395, 39/42, C12Q 1/00, 1/70, 
G01N 33/53 



(11) International Publication Number: WO 00/69462 

(43) International Publication Date: 23 November 2000 (23.1 1 .00) 



(21) International Application N 

(22) International Filing Date: 



PCT/US00/13694 
3 May 2000 (18.05.00) 



18 May 1999 (18.05.99) 



(71) Applicant (for all designated States except US): SM1THKL1NE 

BEECHAM CORPORATION [US/US]; One Franklin Plaza, 
Philadelphia, PA 19103 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): GROSS, Mitchell, S. 
[US/US]; 667 Pugh Road, Wayne, PA 19087 (US). SWEET, 
Raymond, W. [US/US]; 108 Edgehill Road, Bala Cynwyd, 
PA 19004 (US). TAYLOR, Geraldine [GB/GB]; Compton, 
Newbury, Berkshire RG20 7NN (GB). 

(74) Agents: BAUMEISTER, Kirk et al.; Smithkline Beecham 
Corporation, Corporate Intellectual Property, UW2220, 709 
Swedeland Road, P.O. Box 1539, King of Prussia, PA 
19406-0939 (US). 



(81) Designated States: AE, AL, AU, BA, BB, BG, BR, CA, CN, 
CZ, DZ, EE, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KP, KR, LC, LK, LR, LT, LV, MA, MG, MK, MN, MX, 
NO, NZ, PL, RO, SG, SI, SK, SL, TR, TT, TZ, UA, US, 
UZ, VN, YU, ZA, ARIPO patent (GH, GM, KE, LS, MW, 
MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, 
GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Title: HUMAN MONOCLONAL ANTIBODY 
(57) Abstract 

This invention relates to novel human monoclonal antibodies (mAbs) and to the genes encoding same. More specifically, this invention 
relates to human monoclonal antibodies specifically reactive with an epitope of the fusion (F) protein of Respiratory Syncytial Virus (RSV). 
Such antibodies are useful for the therapeutic and/or prophylactic treatment of RSV infection in human patients, particularly infants and 
young children. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



Azerbaijan 

Bosnia and Herzegovina 

Barbados 

Belgium 

Bulgaria 

Brazil 

Central African Republic 



Lesotho 



Republic of Moldova 

Madagascar 

The former Yugoslav 

Republic of Macedon 



Chad 
Togo 
Tajikistan 
Turkmenista 



WO 00/69462 



PCT/US00/13694 



HUMAN MONOCLONAL ANTIBODY 



Field of the Invention 

5 This invention relates to novel human monoclonal 

antibodies (mAbs) and to the genes encoding same. More 
specifically, this invention relates to human monoclonal 
antibodies specifically reactive with an epitope of the 
fusion (F) protein of Respiratory Syncytial Virus (RSV) . 
10 Such antibodies are useful for the therapeutic and/or 
prophylactic treatment of RSV infection in human 
patients, particularly infants and young children. 

Background of the Invention 

15 Respiratory syncytial virus (RSV) is the major 

cause of lower respiratory disease in children, giving 
rise to predictable annual epidemics of bronchiolitis 
and pneumonia in children worldwide. The virus is 
highly contagious, and infections can occur at any age. 

20 Comprehensive details concerning RSV infection and its 
clinical features can be obtained from excellent recent 
reviews by Mcintosh, K. and R. M. Chanock, In: 
"Respiratory Syncytial Virus", Ch. 38, B.N. Fields ed. , 
Raven Press (1990) and Hall, C.B., In: "Textbook of 

25 Pediatric Disease" Feigin and Cherry, eds . , W.B. 
Saunders, pgs 1247-1268 (1987). 

RSV is distributed worldwide. One of the most 
remarkable features of the epidemiology of RSV virus, as 
mentioned above, is the consistent pattern of infection 

3 0 and disease. Other respiratory viruses cause epidemics 
at irregular intervals or exhibit a mixed 
endemic /epidemic pattern, but RSV is the only 
respiratory viral pathogen that produces a sizable 
epidemic every year in large urban centers . In the 
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temperate areas of the world, RSV epidemics have 
occurred primarily in the late fall, winter or spring 
but never during the summer. The occurrence and spread 
of infection within a community is characteristic and 
5 easily diagnosed, leading to sharp rises in cases of 

bronchiolitis and pediatric pneumonia and the number of 
hospital admissions of young children with acute lower 
respiratory tract disease. Other respiratory viral 
agents that occur in outbreaks are rarely present at the 

10 same time as RSV. 

Primary RSV infection occurs in the very young. 
Zero to 2 year old infants are the most susceptible and 
represent the primary affected population. In this 
group, 1 out of 5 will develop lower respiratory (below 

15 larynx) disease upon infection and this ratio stays the 
same upon reinfection. By 1 year of age, 25-50% of 
infants have specific antibodies as a result of natural 
infection and this is close to 100% by age 4-5. Thus, 
virtually all children have been infected before they 

20 have entered school. 

Age, sex, socioeconomic and environmental factors 
can all influence the severity of disease. 
Hospitalization is required in 1-3% of cases of RSV 
infection and is usually of long duration (up to 3 

25 weeks) . The high morbidity of RSV infection, especially 
in infancy, has also been implicated in the development 
of respiratory problems later in life. With current 
intensive care in the U.S. and the other developed 
countries, overall mortality for normal subjects is low 

3 0 (less than 2% of hospitalized subjects) . However, 

mortality is much higher in less developed countries 
and, even in developed countries, mortality is high in 
certain risk groups such as in infants with underlying 
cardiac condition (cyanotic congenital heart disease) or 
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respiratory disease (bronchopulmonary dysplasia) where 
the progression of symptoms may be rapid. For instance, 
mortality in infants with cyanotic congenital heart 
disease has been reported to be as high as 37%. In 
5 premature infants apneic spells due to RSV infection may 
occur and, in rare cases, cause neurologic or systemic 
damage. Severe lower respiratory tract illness 
(bronchiolitis and pneumonia) is most common in patients 
under six months of age. Infants who have apparently 

10 recovered completely from this illness may display 
symptomatic respiratory abnormalities for years 
(recurrent wheezing, decreased pulmonary function, 
recurrent cough, asthma, and bronchitis). 

Immunity to RSV appears to be short-lived, thus 

15 reinfections are frequent. The mechanisms by which the 
immune system protects against RSV infection and 
reinfection are not well understood. It is clear, 
however, that immunity is only partially protective 
since reinfection is common at all ages, and sometimes 

2 0 occurs in infants only weeks after recovery from a 

primary infection. Both serum and secretory antibodies 
(IgA) have been detected in response to RSV infection in 
adults as well as in very young infants. However, the 
titers of serum antibodies to the viral F or G 
25 glycoprotein, as well as of neutralizing antibodies 

found in infants (1-8 months of age) are 15-25% of those 
found in older subjects. These reduced titers may 
contribute to the increased incidence of serious 
infection in younger children. 

3 0 Evidence for the role of serum antibodies in 

protection against RSV virus has emerged from 
epidemiological as well as animal studies . In adults 
exposed naturally to the virus, susceptibility 
correlated well with low serum antibody level. In 
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infants, titers of maternally transmitted antibodies 
correlate with resistance to serious disease [Glezen, 
W.P. et al., J. Pediatr . 98:708-715 (1981)]. Other 
studies show that the incidence and severity of lower 
5 respiratory tract involvement is diminished in the 

presence of high serum antibody [Mcintosh, K. et al . , J . 
Infect. Pis . 138:24-32 (1978)] and high titers of 
passively administered serum neutralizing antibodies 
have been shown to be protective in a cotton rat model 

10 of RSV infection [Prince, G. A. et al . , Virus Res . 
3 : 193-206 (1985) ] . 

Children lacking cell-mediated immunity are unable 
to overcome their infection and shed virus for many 
months in contrast to children with normal immune 

15 systems- Similarly, nude mice infected with RSV virus 
persistently shed virus. These mice can be cured by 
adoptive transfer of primed T cells [Cannon, M. J. et 
al., Immunology 62:133-138 (1987)]. 

In summary, it appears that both cellular and 

2 0 humoral immunity are involved in protection against 

infection, reinfection and RSV disease and that although 
antigenic variation is limited, protective immunity is 
not complete even after multiple exposures. 

RSV, belonging to the family paramyoxoviridae , is a 

25 negative-strand unsegmented RNA virus with properties 
similar to those of the paramyxoviruses. It has, 
however been placed in a separate genus Pneumovirus, 
based on morphologic differences and lack of 
hemagglutinin and neuraminidase activities. RSV is 

30 pleomorphic and ranges in size from 150-300 nm in 

diameter. The virus matures by budding from the outer 
membrane of a cell and virions appear as membrane -bound 
particles with short, closely spaced projections or 
"spikes". The RNA genome encodes 10 unique viral 
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polypeptides ranging in size from 9.5 kDa to 160 kDa 
[Huang, Y. T. and G. W. Wertz, J. Virol . 43:150-157 
(1982)]. Seven proteins (F, G, N , P, L, M , M2 ) are 
present in RSV virions and at least three proteins (F, 
5 G, and SH) are expressed on the surface of infected 
cells. The F protein [ SEQ ID NO: 20] has been 
conclusively identified as the protein responsible for 
cell fusion since specific antibodies to this protein 
inhibit syncytia formation in vitro and cells infected 

10 with vaccinia virus expressing recombinant F protein 
form syncytia in the absence of other RSV virus 
proteins. In contrast, antibodies to the G protein do 
not block syncytia formation but prevent attachment of 
the virus to cells . 

15 RSV can be divided into two antigenical ly distinct 

subgroups, (A & B) [Mufson, M. A. et al . , J. Gen'l. 
Virol . 66:2111-2124 (1985)]. This antigenic dimorphism 
is linked primarily to the surface attachment (G) 
glycoprotein [Johnson, R. A. et al . , Proc . Nat ' 1 . Acad. 

20 Sci. USA 84:5625-5629 (1987)]. Strains of both group A 
and B circulate simultaneously, but the proportion of 
each may vary unpredictably from year to year. An 
effective therapy must therefore target both subgroups 
of the virus and this is the reason for the selection of 

25 the highly conserved surface fusion (F) protein as 
target antigen for mAb therapy as will be discussed 
later. 

The induction of neutralizing antibodies to RSV 
virus appears to be limited to the F and G surface 
3 0 glycoproteins. Of these two proteins, the F protein is 
the major target for cross-reactive neutralizing 
antibodies associated with protection against different 
strains of RSV virus. In addition, experimental 
vaccination of mice or cotton rats with F protein also 
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results in cross protection. The antigenic relatedness 
of the F protein across strains and subgroups of the 
virus is reflected in its high degree of homology at the 
amino acid level. In contrast, in the two subgroups and 
5 various strains of RSV, antigenic dimorphism was linked 
primarily to the G glycoprotein. The F protein has a 
predicted molecular weight of 68-70 kDa ; a signal 
peptide at its N-terminus; a membrane anchor domain at 
its C terminus; and is cleaved proteolytically in the 

10 infected cell prior to virion assembly to yield 

disulfide linked F 2 and Fi . Five neutralizing epitopes 
have been identified within the F protein sequence [SEQ 
ID NO: 20] and map to residues 205-225; 259-278; 289- 
299; 483-488 and 417-438. Studies to determine the 

15 frequency of sequence diversion in the F protein [SEQ ID 
NO: 20] showed that the majority of the neutralizing 
epitopes were conserved in all of the 2 3 strains of RSV 
virus isolated in Australia, Europe, and regions of the 
U.S. over a period of thirty years. In another study, 

2 0 seroresponses of forty three infants and young children 
to primary infection with subgroup A or a subgroup B 
strain showed that responses to homologous and 
heterologous F antigens were not significantly 
different, while the G proteins of the subgroup A and B 

2 5 strains were quite unrelated. Moreover, antibody 

inhibition of virus-mediated cell fusion in vitro versus 
inhibition of infection correlates best with protection 
in animal models and fusion inhibition is primarily 
restricted to F protein specific antibodies. 

3 0 Prophylactic treatment for RSV infection is thus 

desirable for the high risk groups of children as well 
as for all children in underdeveloped countries . 
However, a vaccine for RSV infection is not currently 
available. Severe safety issues surrounding an 
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attenuated whole virus vaccine tested in the 1960s, as 
well as the potential of induced immunopathology 
associated with the newer candidate subunit vaccines 
make the prospects of a vaccine in the near future 
5 appear remote. To date one drug therapy, Ribavirin, a 
broad spectrum antiviral, has been approved. Ribavirin 
has gained only minimal acceptance owing to problems of 
administration, mild toxicity and questionable efficacy. 
In the majority of cases, hospitalized children receive 

10 no drug therapy and receive only intensive supportive 

care which is extremely costly. It is clear that there 
is a need for a safe, effective and easily administered 
drug for the treatment of RSV infection. 

The use of passive antibody therapy in humans is 

15 well documented and is being used to treat other 
infectious diseases such as hepatitis and 
cytomegalovirus. The feasibility of passive antibody 
treatment /protection against RSV has been well 
established using animal models. Most of the earlier 

20 passive transfer studies in animals against infectious 

agents, including RSV , utilized murine mABs. Studies in 
animals have clearly demonstrated that polyclonal and 
monoclonal antibody against both F and G glycoprotein 
can confer passive protection in RSV virus infection 

25 when given prophylac tically or therapeutically [Prince, 
et al . , supra ] . In these studies, passive transfer of 
neutralizing F or G mAbs to mice, cotton rats or 
monkeys, significantly reduce or completely prevent 
replication of the RSV virus in the lungs. However, as 

30 discussed above, clearly, the F protein is the more 
important target for antibody therapy. 

Recently, the FDA has approved for use intravenous 
gammaglobulins (IVIG) isolated from pooled human sera. 
Initial reports from this study had been encouraging 
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[Groothuis, J. R. et al . , Antimicrob. Agents Chemo . 
35 (7 ): 1469-1473 (1991)]. However, generic shortcomings 
of IVIGs exist and include, without limitation, the fact 
that such products are human blood derived and grams of 
5 antibody often need to be administered to achieve an 
effective dose. 

Alternatively, monoclonal antibodies have been 
employed. The advantages of such an approach include: a 
higher concentration of specific antibody can be 

10 achieved thereby reducing the amount of globulin 

required to be given; the reliance on direct blood 
products can be eliminated; the levels of antibody in 
the preparation can be more uniformly controlled and the 
routes of administration can be extended. While passive 

15 immunotherapy employing monoclonal antibodies from a 

heterologous species (e.g., murine) has been suggested 
(See: PCT Application PCT/US94 / 0 8699 , Publication No. WO 
95/04081) , one alternative to reduce the risk of an 
undesirable immune response on the part of the patient 

2 0 directed against the foreign antibody is to employ 

"humanized" antibodies. These antibodies are 
substantially of human origin, with only the 
Complementarity Determining Regions (CDRs) being of non- 
human origin. Particularly useful examples of this 
25 approach are disclosed in PCT Application 

PCT/GB91/01554 , Publication No. WO 92/04381 and PCT 
Application PCT / GB 93/00725, Publication No. WO93/20210. 
Clinical trials are on-going to evaluate the efficacy of 
humanized antibodies for treatment of RSV infection in 

3 0 young children. 

A second and more preferred approach is to employ 
fully human mAbs . Unfortunately, there have been few 
successes in producing human monoclonal antibodies 
through classic hybridoma technology. Indeed, 
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acceptable human fusion partners have not been 
identified and murine myeloma fusion partners do not 
work well with human cells, yielding unstable and low 
producing hybridoma lines. However, recent advances in 
5 molecular biology and immunology make it now possible to 
isolate human mABs , particularly directed against 
foreign infectious agents . 

Fully human mAbs to RSV F protein [ SEQ ID NO : 20] 
remain a desirable option for the treatment of this 

10 disease. Although some success has been reported in 

obtaining fragments of such mAbs [Barbas, C.F. et al . , 
Proc. Nat'l. Acad. Sci . USA 89:10164-10168 (1992); 
Crowe, J. E. et al . , Proc. Nat'l. Acad. Sci. USA 91: 
1386-1390 (1994) and PCT application number 

15 PCT/US93/08786, published as WO94/06448, March 31, 
1994)], the achievement of such results is not 
straightforward. Novel human mABs, when and however 
obtained, are particularly useful alone or in 
combination with existing molecules to form 

2 0 immuno therapeutic compositions. 

There exists a need in the art for useful 
prophylactic compositions for the prevention or passive 
treatment of RSV. 



2 5 Brief Description of the Invention 

In one aspect, this invention provides fully human 
monoclonal antibodies and functional fragments thereof 
specifically reactive with an F protein epitope of RSV 
and capable of neutralizing RSV infection. These human 
30 mABs specific for the F protein of RSV virus may be 
useful to passively treat or prevent infection. 

In another aspect, the present invention provides 
modifications to neutralizing single chain Fv fragments 
(scFV) specific for the F protein of RSV produced by 
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random combinatorial cloning of human antibody sequences 
and isolated from a filamentous phage Fab display 
library. 

In still another aspect, there is provided a 
5 reshaped or altered human antibody containing human 
heavy and light chain constant regions from a first 
human donor and heavy and light chain variable regions 
or the CDRs thereof derived from human neutralizing 
monoclonal antibodies for the F protein of RSV derived 

10 from a second human donor. 

In yet another aspect, the present invention 
provides a pharmaceutical composition which contains one 
(or more) altered or reshaped antibodies and a 
pharmaceutically acceptable carrier. 

15 In yet another aspect, the invention provides a 

pharmaceutical composition comprising at least one dose 
of an immunotherapeutically effective amount of the 
reshaped, altered or monoclonal antibody of this 
invention in combination with at least one additional 

20 monoclonal, altered or reshaped antibody. A particular 
embodiment is provided in which the additional antibody 
is an anti-RSV antibody distinguished from the subject 
antibody of the invention by virtue of being reactive 
with a different epitope of the RSV F protein antigen 

25 than the subject antibody of the invention. 

In a further aspect, the present invention provides 
a method for passive immunotherapy of RSV disease in a 
human by administering to said human an effective amount 
of the pharmaceutical composition of the invention for 

3 0 the prophylactic or therapeutic treatment of RSV 
infection . 

In yet another aspect, the present invention 
provides methods for, and components useful in, the 
recombinant production of human and altered antibodies 
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(e.g., engineered antibodies, CDRs , Fab or F(ab) 2 
fragments, or analogs thereof) which are derived from 
human neutralizing monoclonal antibodies (mAbs) for the 
F protein of RSV. These components include isolated 
5 nucleic acid sequences encoding same, recombinant 

plasmids containing the nucleic acid sequences under the 
control of selected regulatory sequences which are 
capable of directing the expression thereof in host 
cells (preferably mammalian) transfected with the 

10 recombinant plasmids. The production method involves 
culturing a transfected host cell line of the present 
invention under conditions such that the human or 
altered antibody is expressed in said cells and 
isolating the expressed product therefrom. 

15 In still another aspect of the invention is a 

method to diagnose the presence of RSV in a human which 
comprises contacting a sample of biological fluid with 
the human antibodies and altered antibodies and 
fragments thereof of the instant invention and assaying 

2 0 for the occurrence of binding between said human 

antibody (or altered antibody, or fragment) and RSV. 

Other aspects and advantages of the present 
invention are described further in the detailed 
description and the preferred embodiments thereof. 

25 

Brief Description of the Drawings 

Fig. 1A is a graph illustrating the competition of 
GA.-1 scFV phage binding with RSV19 mAb [International 
patent publication No. WO92/043 81, published March 19, 
30 1992] . 

Fig. IB is a graph illustrating the competition of 
GA.-1 scFV phage binding with RSV B4 mAb [International 
patent publication No. WO93/20210, published October 14, 
1993] . 
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Fig. 2 is a graph illustrating virus neutralization 
by scFV phages, GX-1 , GX-3 , and G K -1 with RSV strain 273. 

Fig. 3 illustrates the DNA sequence [SEQ ID NO : 1] 
and protein sequence (amino acids reported in single 
5 letter code) [SEQ ID NO: 2] for the GA.-1 light chain 

variable region, processed N- terminus through framework 
IV. 

Fig. 4 illustrates the DNA sequence [SEQ ID NO : 3] 
and protein sequence (amino acids reported in single 
10 letter code) [SEQ ID NO : 4] for the Gk-1 heavy chain 

variable region, processed N- terminus through framework 
IV. 

Fig. 5 illustrates the cloning strategy used for 
the construction of the GX.-1 monoclonal antibody. The 

15 heavy chain V region was cloned into the pCD derivative 
vector as a XhoJ. - Apal fragment. The entire light 
chain V region was cloned into the pCN derivative 
vector, 43-lpcn, as a .Sad - AvrTI fragment. Details 
are described below. 

2 0 Fig. 6 provides a comparison of the heavy chain 

amino acid sequences of the GX-1 single chain F v [SEQ ID 
NO: 5] and various monoclonal antibodies of this 
invention. The amino acid sequences of the heavy chains 
for the A [SEQ ID NO : 7] and B [SEQ ID NO : 8] constructs 

2 5 are shown. Numbering of the residues is based on the 

germline (GL) gene Dp58 [SEQ ID NO: 6] , beginning at the 
mature processed amino terminus and ending at CDR3 . The 
"-" indicates identity to the preceding sequence (eg., A 
compared to B) . Bold residues correspond to the leader 

3 0 region, and to CDRs 1-3 . 

Fig. 7 provides a comparison of the light chain 
amino acid sequences of the GA.-1A single chain F v [SEQ ID 
NO: 9] and various monoclonal antibodies of this 
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invention. The amino acid sequences of the light chains 
for the A [ SEQ ID NO: 11] and B [ SEQ ID NO: 12] 
constructs are shown. Numbering of the residues in the 
Vk region is based on the germline (GL) gene DpL8 [SEQ 
5 ID NO: 10] , beginning at the mature processed amino 

terminus and ending at CDR3 . For reference to framework 
4, the actual numbering is also shown for GX-1A. As in 
Fig. 6, the "-" indicates identity to the preceding 
sequence . 

10 Figs. 8A to 8F illustrate the continuous DNA 

sequence [SEQ ID NO: 13] of the expression plasmid GX- 
lApcd containing the RSV neutralizing human GA.-1 mAb for 
the heavy chain. The start of translation, leader 
peptide, amino- terminal processing site, carboxy 

15 terminus of the GX-1 heavy chain, and Eco RI restriction 
endonuclease cleavage site are shown. 

Figs. 9A to 9E illustrate the continuous DNA 
sequence [SEQ ID NO: 14] of the expression plasmid Gk- 
lApcn containing the RSV neutralizing human GX.-1 mAb for 

2 0 the light chain. The corresponding features for the 
light chain as for Figs. 8A-8F are shown. 

Figs. 10A and 10B illustrate the continuous DNA 
sequence [SEQ ID NO: 15] of the coding region of the 
heavy chain of plasmid GX-lBpcd. Bolded residues 

25 indicate differences from the full vector sequence for 
GA.-lApcd in Figs. 8A-8F [SEQ ID NO: 13]. 

Fig. 11 is the DNA sequence [SEQ ID NO: 16] of the 
coding region for the light chain of plasmid GA.-lBpcn. 
Bolded residues indicate differences from the full 

30 vector sequence for GA.-lApcn in Figs. 9A-9E [SEQ ID NO: 
14] . 
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Detailed Description of the Invention 

This invention provides useful human monoclonal 
antibodies (and fragments thereof) reactive with the F 
protein of RSV, isolated nucleic acids encoding same and 
5 various means for their recombinant production as well 
as therapeutic, prophylactic and diagnostic uses of such 
antibodies and fragments thereof . 

I. Definitions. 

As used in this specification and the claims, the 

10 following terms are defined as follows : 

"Altered antibody" refers to a protein encoded by 
an altered immunoglobulin coding region, which may be 
obtained by expression in a selected host cell. Such 
altered antibodies are engineered antibodies (e.g., 

15 chimeric, humanized, or reshaped or immunologically 

edited human antibodies) or fragments thereof lacking 
all or part of an immunoglobulin constant region, e.g., 
Fv, Fab, or F(ab')2 and the like. 

"Altered immunoglobulin coding region" refers to a 

20 nucleic acid sequence encoding an altered antibody of 
the invention or a fragment thereof. 

"Reshaped human antibody" refers to an altered 
antibody in which minimally at least one CDR from a 
first human monoclonal donor antibody is substituted for 

25 a CDR in a second human acceptor antibody. Preferrably 
all six CDRs are replaced. More preferrably an entire 
antigen combining region (e.g., Fv, Fab or F(ab')2 ) from 
a first human donor monoclonal antibody is substituted 
for the corresponding region in a second human acceptor 

3 0 monoclonal antibody. Most preferrably the Fab region 
from a first human donor is operatively linked to the 
appropriate constant regions of a second human acceptor 
antibody to form a full length monoclonal antibody. 
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"First immunoglobulin partner" refers to a nucleic 
acid sequence encoding a human framework or human 
immunoglobulin variable region in which the native (or 
naturally-occurring) CDR-encoding regions are replaced 
5 by the CDR-encoding regions of a donor human antibody. 

The human variable region can be an immunoglobulin heavy 
chain, a light chain (or both chains), an analog or 
functional fragments thereof. Such CDR regions, located 
within the variable region of antibodies 

10 (immunoglobulins) can be determined by known methods in 
the art. For example, Rabat et al . ( Sequences of 
Proteins of Immunological Interest , 4th Ed., U.S. 
Department of Health and Human Services, National 
Institutes of Health (1987)) disclose rules for locating 

15 CDRs . In addition, computer programs are known which 
are useful for identifying CDR regions/structures. 

"Second fusion partner" refers to another 
nucleotide sequence encoding a protein or peptide to 
which the first immunoglobulin partner is fused in frame 

20 or by means of an optional conventional linker sequence 
(i.e., operatively linked). Preferably the fusion 
partner is an immunoglobulin gene and when so, it is 
referred to as a "second immunoglobulin partner". The 
second immunoglobulin partner may include a nucleic acid 

25 sequence encoding the entire constant region for the 
same (i.e., homologous - the first and second altered 
antibodies are derived from the same source) or an 
additional (i.e., heterologous) antibody of interest. 
It may be an immunoglobulin heavy chain or light chain 

3 0 (or both chains as part of a single polypeptide) . The 
second immunoglobulin partner is not limited to a 
particular immunoglobulin class or isotype. In 
addition, the second immunoglobulin partner may comprise 
part of an immunoglobulin constant region, such as found 
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in a Fab, or F(ab) 2 (i.e., a discrete part of an 
appropriate human constant region or framework region) . 

A second fusion partner may also comprise a sequence 
encoding an integral membrane protein exposed on the 
5 outer surface of a host cell, e.g., as part of a phage 
display library, or a sequence encoding a protein for 
analytical or diagnostic detection, e.g., horseradish 
peroxidase (HRP), P-galactosidase , etc. 

The terms Fv, Fc, Fd, Fab, or F(ab')2 are used with 
10 their standard meanings [see, e.g., Harlow et al . , 
Antibodies A Laboratory Manual , Cold Spring Harbor 
Laboratory, (19 88) ] . 

As used herein, an "engineered antibody" describes 
a type of altered antibody, i.e., a full-length 
15 synthetic antibody (e.g., a chimeric, humanized, 

reshaped or immunologically edited human antibody as 
opposed to an antibody fragment) in which a portion of 
the light and/or heavy chain variable domains of a 
selected acceptor antibody are replaced by analogous 

2 0 parts from one or more donor antibodies which have 

specificity for the selected epitope. For example, such 
molecules may include antibodies characterized by a 
humanized heavy chain associated with an unmodified 
light chain (or chimeric light chain), or vice versa. 
25 Engineered antibodies may also be characterized by 

alteration of the nucleic acid sequences encoding the 
acceptor antibody light and/or heavy variable domain 
framework regions in order to retain donor antibody 
binding specificity. These antibodies can comprise 

3 0 replacement of one or more CDRs (preferably all) from 

the acceptor antibody with CDRs from a donor antibody 
described herein. 

A "chimeric antibody" refers to a type of 
engineered antibody which contains naturally-occurring 
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variable region (light chain and heavy chains) derived 
from a donor antibody in association with light and 
heavy chain constant regions derived from an acceptor 
antibody from a heterologous species. 
5 A "humanized antibody" refers to a type of 

engineered antibody having its CDRs derived from a non- 
human donor immunoglobulin, the remaining 
immunoglobulin-derived parts of the molecule being 
derived from one (or more) human immunoglobulin ( s ) . In 

10 addition, framework support residues may be altered to 
preserve binding affinity [see, e.g., Queen et al . , 
Proc. Nat'l. Acad. Sci. USA , 86:10029-10032 (1989), 
Hodgson et al . , Bio/Technology, 9:421 (1991) ] . 

An "immunologically edited antibody" refers to a 

15 type of engineered antibody in which changes are made in 
donor and/or acceptor sequences to edit regions in 
respect of cloning artifacts, germ line enhancements, 
etc . aimed at reducing the likelihood of an 
immunological response to the antibody on the part of a 

2 0 patient being treated with the edited antibody. 

The term "donor antibody" refers to an antibody 
(monoclonal, or recombinant) which contributes the 
nucleic acid sequences of its variable regions, CDRs, or 
other functional fragments or analogs thereof to a first 
25 immunoglobulin partner, so as to provide the altered 
immunoglobulin coding region and resulting expressed 
altered antibody with the antigenic specificity and 
neutralizing activity characteristic of the donor 
antibody. One donor antibody suitable for use in this 

3 0 invention is a Fab fragment of a human neutralizing 

monoclonal antibody designated as Fab GX-1. Fab GA.-1 is 
defined as a having the variable light and heavy chain 
DNA and amino acid sequences GA.-1 as shown in Figs. 3, 
4, 8A-8F and 9A-9E [ SEQ ID NOS : 1-4, 13 and 14]. 
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The term "acceptor antibody" refers to an antibody 
(monoclonal or recombinant) from a source genetically 
unrelated to the donor antibody, which contributes all 
(or any portion, but preferably all) of the nucleic acid 
5 sequences encoding its heavy and/or light chain 

framework regions and/or its heavy and/or light chain 
constant regions to the first immunoglobulin partner. 
Preferably a human antibody is the acceptor antibody. 
"CDRs" are defined as the complementarity 
10 determining region amino acid sequences of an antibody 
which are the hypervariable regions of immunoglobulin 
heavy and light chains [see, e.g., Rabat et al . , 
Sequences of Proteins of Immunological Interest , 4th 
Ed., U.S. Department of Health and Human Services, 
15 National Institutes of Health (1987)]. There are three 
heavy chain and three light chain CDRs (or CDR regions) 
in the variable portion of an immunoglobulin. Thus, 
"CDRs" as used herein refers to all three heavy chain 
CDRs, or all three light chain CDRs (or both all heavy 

2 0 and all light chain CDRs, if appropriate) . CDRs provide 

the majority of contact residues for the binding of the 
antibody to the antigen or epitope. CDRs of interest in 
this invention are derived from donor antibody variable 
heavy and light chain sequences, and include analogs of 
25 the naturally occurring CDRs, which analogs also share 
or retain the same antigen binding specificity and/or 
neutralizing ability as the donor antibody from which 
they were derived. 

By "sharing the antigen binding specificity or 

3 0 neutralizing ability" is meant, for example, that 

although Fab GX-1 may be characterized by a certain 
level of antigen affinity, a CDR encoded by a nucleic 
acid sequence of Fab GX-1 in an appropriate structural 
environment may have a lower, or higher affinity. It is 
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expected that CDRs of Fab GX-1 in such environments will 
nevertheless recognize the same epitope (s) as does the 
intact Fab GA.-1 . A "functional fragment" is a partial 
heavy or light chain variable sequence (e.g., minor 
5 deletions at the amino or carboxy terminus of the 

immunoglobulin variable region) which retains the same 
antigen binding specificity and/or neutralizing ability 
as the antibody from which the fragment was derived. 

An "analog" is an amino acid sequence modified by 
10 at least one amino acid, wherein said modification can 
be a chemical modification, or a substitution or a 
rearrangement of a few amino acids (i.e., no more than 
10), which modification permits the amino acid sequence 
to retain the biological characteristics, e.g., antigen 
15 specificity and high affinity, of the unmodified 
sequence. For example, (silent) mutations can be 
constructed, via substitutions, when certain 
endonuclease restriction sites are created within or 
surrounding CDR-encoding regions. 
20 Analogs may also arise as allelic variations. An 

"allelic variation or modification" is an alteration in 
the nucleic acid sequence encoding the amino acid or 
peptide sequences of the invention. Such variations or 
modifications may be due to degeneracy in the genetic 
25 code or may be deliberately engineered to provide 
desired characteristics. These variations or 
modifications may or may not result in alterations in 
any encoded amino acid sequence. 

The term "effector agents" refers to non-protein 
3 0 carrier molecules to which the altered antibodies, 

and/or natural or synthetic light or heavy chains of the 
donor antibody or other fragments of the donor antibody 
may be associated by conventional means . Such non- 
protein carriers can include conventional carriers used 
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in the diagnostic field, e.g., polystyrene or other 
plastic beads, polysaccharides, e.g., as used in the 
BIAcore (Pharmacia) system, or other non-protein 
substances useful in the medical field and safe for 
5 administration to humans and animals. Other effector 
agents may include a macrocycle, for chelating a heavy 
metal atom, or radioisotopes. Such effector agents may 
also be useful to increase the half-life of the altered 
antibodies, e.g., polyethylene glycol. 

10 II. Combinatorial Cloning. 

As mentioned above, a number of problems have 
hampered the direct application of the hybridoma 
technology [G. Kohler and C. Milstein, Nature , 256: 495- 
497 (1975)] to the generation and isolation of human 

15 monoclonal antibodies. Among these are a lack of 

suitable fusion partner myeloma cell lines used to form 
hybridoma cell lines as well as the poor stability of 
such hybridomas even when formed. These shortcomings 
are further exacerbated in the case of RSV because of 

20 the paucity of viral specific B cells in the peripheral 
circulation. Therefore, the molecular biological 
approach of combinatorial cloning is preferred. 

Combinatorial cloning is disclosed generally in PCT 
Publication No. WO90/14430. Simply stated, the goal of 

25 combinatorial cloning is to transfer to a population of 
bacterial cells the immunological genetic capacity of a 
human cell, tissue or organ. It is preferred to employ 
cells, tissues or organs which are immunocompetent. 
Particularly useful sources include, without limitation, 

3 0 spleen, thymus, lymph nodes, bone marrow, tonsil and 
peripheral blood lymphocytes. The cells may be 
optionally RSV stimulated in vitro, or selected from 
donors which are known to have produced an immune 
response or donors who are HIV + but asymptomatic . 
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The genetic information isolated from the donor 
cells can be in the form of DNA or RNA and is 
conveniently amplified by Polymerase Chain Reaction 
(PCR) or similar techniques. When isolated as RNA the 
5 genetic information is preferably converted into cDNA by 
reverse transcription prior to amplification. The 
amplification can be generalized or more specifically 
tailored. For example, by a careful selection of PCR 
primer sequences, selective amplification of 

10 immunoglobulin genes or subsets within that class of 
genes can be achieved. 

Once the component gene sequences are obtained, in 
this case the genes encoding the variable regions of the 
various heavy and light antibody chains, the light and 

15 heavy chain genes are associated in random combinations 
to form a random combinatorial library. Various 
recombinant DNA vector systems have been described to 
facilitate combinatorial cloning [see: PCT Publication 
No. WO90/14430 supra ; Scott and Smith, Science 249:386- 

20 406 (1990); or U. S. Patent 5,223,409]. Having 

generated the combinatorial library, the products can, 
after expression, be conveniently screened by biopanning 
with RSV F protein or, if necessary, by epitope blocked 
biopanning as described in more detail below. 

25 As described herein, it is preferred to use single 

chain antibodies for combinatorial cloning and screening 
and then to convert them to full length mAbs after 
selection of the desired candidate molecules. However, 
Fab fragments of mAbs can also be used for cloning and 

30 screening. 

III. Antibody Fragments. 

The present invention contemplates the use of scFv, 
Fab, or F(ab')2 fragments to derived full-length mAbs 
directed against the F protein of RSV. Although these 
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fragments may be independently useful as protective and 
therapeutic agents in vivo against RSV-mediated 
conditions or in vitro as part of an RSV diagnostic, 
they are employed herein as a component of a reshaped 
5 human antibody. A scFv fragment contains the light and 
heavy chain variable regions joined by a linker of about 
12 amino acids in either a light-linker-heavy or a 
heavy-linker-light orientation. A Fab fragment contains 
the entire light chain and amino terminal portion of the 

10 heavy chain; and a F(ab') 2 fragment is the fragment 
formed by two Fab fragments bound by additional 
disulfide bonds . RSV binding monoclonal antibodies 
provide sources of scFv or Fab fragments which can be 
obtained from a combinatorial phage library [see, e.g., 

15 Winter et al . , Ann. Rev. Immunol ., 12:433-455 (1994) or 
Barbas et al . , Proc . Nat ' 1 . Acad. Sci . (USA) 89, 10164- 
10168 (1992), which are both hereby incorporated by 
reference in their entireties] . 

IV. Anti-RSV Antibody Amino Acid and Nucleotide 

20 Sequences of Interest. 

The Fab GX-1 or other antibodies described herein 
may contribute sequences, such as variable heavy and/or 
light chain peptide sequences, framework sequences, CDR 
sequences, functional fragments, and analogs thereof, 

25 and the nucleic acid sequences encoding them, useful in 
designing and obtaining various altered antibodies which 
are characterized by the antigen binding specificity of 
the donor antibody. 

As one example, the present invention thus provides 

3 0 variable light chain and variable heavy chain sequences 
from the RSV human Fab GX-lA and sequences derived 
therefrom. The heavy chain variable region of Fab GX-lA 
is illustrated by Figs. 4, 8A-8F and 10A-10B [ SEQ ID 
NOS : 3-4, 13 and 15] . 
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The nucleic acid sequences of this invention, or 
fragments thereof, encoding the variable light chain and 
heavy chain peptide sequences are also useful for 
mutagenic introduction of specific changes within the 
5 nucleic acid sequences encoding the CDRs or framework 

regions, and for incorporation of the resulting modified 
or fusion nucleic acid sequence into a plasmid for 
expression. For example, silent substitutions in the 
nucleotide sequence of the framework and CDR-encoding 

10 regions can be used to create restriction enzyme sites 
which would facilitate insertion of mutagenized CDR 
(and/or framework) regions. These CDR-encoding regions 
may be used in the construction of reshaped human 
antibodies of this invention. 

15 Taking into account the degeneracy of the genetic 

code, various coding sequences may be constructed which 
encode the variable heavy and light chain amino acid 
sequences, and CDR sequences of the invention as well as 
functional fragments and analogs thereof which share the 

2 0 antigen specificity of the donor antibody. The isolated 

nucleic acid sequences of this invention, or fragments 
thereof, encoding the variable chain peptide sequences 
or CDRs can be used to produce altered antibodies, e.g., 
chimeric or humanized antibodies, or other engineered 
25 antibodies of this invention when operatively combined 
with a second immunoglobulin partner. 

It should be noted that in addition to isolated 
nucleic acid sequences encoding portions of the altered 
antibody and antibodies described herein, other such 

3 0 nucleic acid sequences are encompassed by the present 

invention, such as those complementary to the native 
CDR-encoding sequences or complementary to the human 
framework regions surrounding the CDR-encoding regions. 
Such sequences include all nucleic acid sequences which 
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by virtue of the redundancy of the genetic code are 
capable of encoding the same amino acid sequence as 
given in Figs. 3 and 4 [SEQ ID NOS : 2 and 4] . Figs. 6 
and 7 [SEQ ID NOS: 5-12] provide representations of such 
5 sequences . Other useful DNA sequences encompassed by 
this invention include those sequences which hybridize 
under stringent hybridization conditions [See: T. 
Maniatis et al . , Molecular Cloning (A Laboratory 
Manual) , Cold Spring Harbor Laboratory (1982), pages 387 

10 to 3 89] to the DNA sequences encoding the GX-1 

antibodies (e.g., sequences of Figs. 3, 4, 8A-8F through 
11 [SEQ ID NOS: 1-4, 13-16]) and which retain the 
antigen binding properties of those antibodies . An 
example of one such stringent hybridization condition is 

15 hybridization at 4XSSC at 65°C, followed by a washing in 
0 . 1XSSC at 65°C for an hour. Alternatively an exemplary 
stringent hybridization condition is in 50% formamide, 
4XSSC at 42°C. Preferably, these hybridizing DNA 
sequences are at least about 18 nucleotides in length, 

20 i.e., about the size of a CDR. 

V. Altered Immunoglobulin Coding Regions and 
Altered Antibodies . 

Altered immunoglobulin coding regions encode 
altered antibodies which include engineered antibodies 

25 such as chimeric antibodies, humanized, reshaped, and 
immunologically edited human antibodies. A desired 
altered immunoglobulin coding region contains CDR- 
encoding regions in the form of scFv regions that encode 
peptides having the antigen specificity of an RSV 

3 0 antibody, preferably a high affinity antibody such as 
provided by the present invention, inserted into an 
acceptor immunoglobulin partner. 

When the acceptor is an immunoglobulin partner, as 
defined above, it includes a sequence encoding a second 
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antibody region of interest, for example, an Fc region. 
Immunoglobulin partners may also include sequences 
encoding another immunoglobulin to which the light or 
heavy chain constant region is fused in frame or by 
5 means of a linker sequence. Engineered antibodies 

directed against functional fragments or analogs of RSV 
may be designed to elicit enhanced binding with the same 
antibody . 

The immunoglobulin partner may also be associated 

10 with effector agents as defined above, including non- 
protein carrier molecules, to which the immunoglobulin 
partner may be operatively linked by conventional means . 

Fusion or linkage between the immunoglobulin 
partners, e.g., antibody sequences, and the effector 

15 agent may be by any suitable means, e.g., by 

conventional covalent or ionic bonds, protein fusions, 
or hetero-bifunctional cross-linkers, e.g., 
carbodiimide, glutaraldehyde , and the like. Such 
techniques are known in the art and readily described in 

20 conventional chemistry and biochemistry texts. 

Additionally, conventional linker sequences which 
simply provide for a desired amount of space between the 
second immunoglobulin partner and the effector agent may 
also be constructed into the altered immunoglobulin 

25 coding region. The design of such linkers is well known 
to those of skill in the art. 

In addition, signal sequences for the molecules of 
the invention may be modified to enhance expression. 
For example the reshaped human antibody having the 

3 0 signal sequence and CDRs derived from the Fab Gk-1 heavy 
chain sequence, may have the original signal peptide 
replaced with another signal sequence such as the 
Campath leader sequence [Page, M. J. et al . , 
BioTechnology 9:64-68(1991)]. 
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An exemplary altered antibody, a reshaped human 
antibody, contains a variable heavy and the entire light 
chain peptide or protein sequence having the antigen 
specificity of Fab GA.-1, fused to the constant heavy 
5 regions C H -i-Ch- 3 derived from a second human antibody. 

In still a further embodiment, the engineered 
antibody of the invention may have attached to it an 
additional agent. For example, the procedure of 
recombinant DNA technology may be used to produce an 

10 engineered antibody of the invention in which the Fc 
fragment or C H - 2 C H -3 domain of a complete antibody 
molecule has been replaced by an enzyme or other 
detectable molecule (i.e., a polypeptide effector or 
reporter molecule) . 

15 Another desirable protein of this invention may 

comprise a complete antibody molecule, having full 
length heavy and light chains, or any discrete fragment 
thereof, such as the Fab or F(ab')2 fragments, a heavy 
chain dimer, or any minimal recombinant fragments 

2 0 thereof such as an F v or a single-chain antibody (SCA) or 

any other molecule with the same specificity as the 
selected donor Fab GX-1 . Such protein may be used in 
the form of an altered antibody, or may be used in its 
unfused form. 

25 Whenever the immunoglobulin partner is derived from 

an antibody different from the donor antibody, e.g., any 
isotype or class of immunoglobulin framework or constant 
regions, an engineered antibody results. Engineered 
antibodies can comprise immunoglobulin (Ig) constant 

3 0 regions and variable framework regions from one source, 

e.g., the acceptor antibody, and one or more (preferably 
all) CDRs from the donor antibody, e.g., the anti-RSV 
antibody described herein. In addition, alterations, 
e.g., deletions, substitutions, or additions, of the 
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acceptor mAb light and/or heavy variable domain 
framework region at the nucleic acid or amino acid 
levels, or the donor CDR regions may be made in order to 
retain donor antibody antigen binding specificity or to 
5 reduce potential immunogenici ty . 

Such engineered antibodies are designed to employ 
one (or both) of the variable heavy and/or light chains 
of the RSV mAb (optionally modified as described) or one 
or more of the below-identified heavy or light chain 

10 CDRs . The engineered antibodies of the invention are 

neutralizing, i.e., they desirably inhibit virus growth 
in vitro and in vivo in animal models of RSV infection. 

Such engineered antibodies may include a reshaped 
human antibody containing the human heavy and light 

15 chain constant regions fused to the RSV antibody 
functional fragments. A suitable human (or other 
animal) acceptor antibody may be one selected from a 
conventional database, e.g., the KABAT® database, Los 
Alamos database, and Swiss Protein database, by homology 

2 0 to the nucleotide and amino acid sequences of the donor 

antibody. A human antibody characterized by a homology 
to the framework regions of the donor antibody (on an 
amino acid basis) may be suitable to provide a heavy 
chain constant region and/or a heavy chain variable 
25 framework region for insertion of the donor CDRs. A 
suitable acceptor antibody capable of donating light 
chain constant or variable framework regions may be 
selected in a similar manner. It should be noted that 
the acceptor antibody heavy and light chains are not 

3 0 required to originate from the same acceptor antibody. 

Desirably the heterologous framework and constant 
regions are selected from human immunoglobulin classes 
and isotypes, such as IgG (subtypes 1 through 4) , IgM, 
IgA and IgE. The Fc domains are not limited to native 
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sequences, but include mutant variants known in the art 
that alter function. For example, mutations have been 
described in the Fc domains of certain IgG antibodies 
that reduce Fc-mediated complement and Fc receptor 
5 binding [see, e.g., A. R. Duncan et al . , Nature , 

332:563-564 (1988); A. R. Duncan and G. Winter, Nature , 
332:738-740 (1988); M.-L. Alegre et al . , J . Immunol . , 
148:3461-3468 (1992); M.-H. Tao et al . , J . Exp . Med . , 
178:661-667 (1993); and V. Xu et al . J. Biol. Chem ., 

10 269:3469-2374 (1994)]; alter clearance rate [J.-K. Kim 

et al., Eur. J. Immunol ., 24:542-548 (1994)]; and reduce 
structural heterogeneity [S. Angal et al . , Mol . Immunol . 
30:105-108 (1993)]. Also, other modifications are 
possible such as oligomerization of the antibody by 

15 addition of the tailpiece segment of IgM and other 
mutations [R. I. F. Smith and S. L. Morrison, 
Biotechnology 12:683-688 (1994); R. I. F. Smith et al . , 
J. Immunol . , 154: 2226-2236 (1995)] or addition of the 
tailpiece segment of IgA [I. Kariv et al . , J . Immunol . , 

20 157: 29-38 (1996)]. However, the acceptor antibody need 
not comprise only human immunoglobulin protein 
sequences. For instance a gene may be constructed in 
which a DNA sequence encoding part of a human 
immunoglobulin chain is fused to a DNA sequence encoding 

25 a non- immunoglobulin amino acid sequence such as a 
polypeptide effector or reporter molecule. 

The altered antibody thus preferably has the 
structure of a natural human antibody or a fragment 
thereof, and possesses the combination of properties 

3 0 required for effective therapeutic use, e.g., treatment 
of RSV mediated diseases in man, or for diagnostic uses. 

It will be understood by those skilled in the art 
that an altered antibody may be further modified by 
changes in variable domain amino acids without 
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necessarily affecting the specificity and high affinity 
of the donor antibody (i.e., an analog) . It is 
anticipated that heavy and light chain amino acids may 
be substituted by other amino acids either in the 
5 variable domain frameworks or CDRs or both. 

Particularly preferred is the immunological editing of 
such reconstructed sequences as illustrated in the 
examples herein. 

In addition, the variable or constant region may be 

10 altered to enhance or decrease selective properties of 
the molecules of the instant invention, as described 
above. For example, dimerization , binding to Fc 
receptors, or the ability to bind and activate 
complement [see, e.g., Angal et al . , Mo I . Immunol , 

15 30:105-108 (1993); Xu et al . , J. Biol. Chem , 269:3469- 
3474 (1994); and Winter et al . , EP 307,434-B]. 

Such antibodies are useful in the prevention and 
treatment of RSV mediated disorders, as discussed below. 
VI. Production of Altered antibodies and 

20 Engineered Antibodies . 

The resulting reshaped human antibodies of this 
invention can be expressed in recombinant host cells, 
e.g., COS, CHO or myeloma cells. A conventional 
expression vector or recombinant plasmid is produced by 

25 placing these coding sequences for the altered antibody 
in operative association with conventional regulatory 
control sequences capable of controlling the replication 
and expression in, and/or secretion from, a host cell. 
Regulatory sequences include promoter sequences, e.g., 

3 0 CMV promoter, and signal sequences, which can be derived 
from other known antibodies. Similarly, a second 
expression vector can be produced having a DNA sequence 
which encodes a complementary antibody light or heavy 
chain. Preferably this second expression vector is 
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identical to the first except insofar as the coding 
sequences and selectable markers are concerned. This 
ensures as far as possible that each polypeptide chain 
is functionally expressed. Alternatively, the heavy and 
5 light chain coding sequences for the altered antibody 
may reside on a single vector. 

A selected host cell is co-transf ected by 
conventional techniques with both the first and second 
vectors (or simply transfected by a single vector) to 

10 create the transfected host cell of the invention 

comprising both the recombinant or synthetic light and 
heavy chains. The transfected cell is then cultured by 
conventional techniques to produce the engineered 
antibody of the invention. The production of the 

15 antibody which includes the association of both the 

recombinant heavy chain and light chain is measured in 
the culture by an appropriate assay, such as an enzyme- 
linked immunosorbent assay (ELISA) or radioimmunoassay 
(RIA) . Similar conventional techniques may be employed 

2 0 to construct other altered antibodies and molecules of 
this invention. 

Suitable vectors for the cloning and subcloning 
steps employed in the methods and construction of the 
compositions of this invention may be selected by one of 

2 5 skill in the art. For example, the conventional pUC 

series of cloning vectors, may be used. One vector used 
is pUC19, which is commercially available from supply 
houses, such as Amersham (Buckinghamshire, United 
Kingdom) or Pharmacia (Uppsala, Sweden) . Any vector, 

30 which is capable of replicating readily, has an 

abundance of cloning sites and selectable genes (e.g., 
antibiotic resistance) , and is easily manipulated, may 
be used for cloning. Thus, the selection of the cloning 
vector is not a limiting factor in this invention. 
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Similarly, the vectors employed for expression of 
the engineered antibodies according to this invention 
may be selected by one of skill in the art from any 
conventional vectors . Preferred vectors include for 
5 example plasmids pCD or pCN . The vectors also contain 
selected regulatory sequences (such as CMV promoters) 
which direct the replication and expression of 
heterologous DNA sequences in selected host cells. 
These vectors contain the above described DNA sequences 

10 which code for the engineered antibody or altered 

immunoglobulin coding region. In addition, the vectors 
may incorporate the selected immunoglobulin sequences 
modified by the insertion of desirable restriction sites 
for ready manipulation. 

15 The expression vectors may also be characterized by 

genes suitable for amplifying expression of the 
heterologous DNA sequences, e.g., the mammalian 
dihydrof olate reductase gene (DHFR) . Other preferable 
vector sequences include a polyadenylation (polyA) 

2 0 signal sequence, such as from bovine growth hormone 

(BGH) and the betaglobin promoter sequence (betaglopro) . 
The expression vectors useful herein may be synthesized 
by techniques well known to those skilled in this art. 
The components of such vectors, e.g. replicons, 

25 selection genes, enhancers, promoters, signal sequences 
and the like, may be obtained from commercial or natural 
sources or synthesized by known procedures for use in 
directing the expression and/or secretion of the product 
of the recombinant DNA in a selected host. Other 

30 appropriate expression vectors of which numerous types 
are known in the art for mammalian, bacterial, insect, 
yeast, and fungal expression may also be selected for 
this purpose. 
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The present invention also encompasses a cell line 
transfected with a recombinant plasmid containing the 
coding sequences of the engineered antibodies or altered 
immunoglobulin molecules thereof. Host cells useful for 
5 the cloning and other manipulations of these cloning 

vectors are also conventional. However, most desirably, 
cells from various strains of E . coli are used for 
replication of the cloning vectors and other steps in 
the construction of altered antibodies of this 
10 invention. 

Suitable host cells or cell lines for the 
expression of the engineered antibody or altered 
antibody of the invention are preferably mammalian cells 
such as CHO, COS , a fibroblast cell (e.g., 3T3), and 
15 myeloid cells, and more preferably a CHO or a myeloid 
cell. Human cells may be used, thus enabling the 
molecule to be modified with human glycosylat ion 
patterns. Alternatively, other eukaryotic cell lines 
may be employed. The selection of suitable mammalian 

2 0 host cells and methods for transformation, culture, 

amplification, screening and product production and 
purification are known in the art. See, e.g., Sambrook 
et al . , Molecular Cloning (A Laboratory Manual) , 2nd 
edit., Cold Spring Harbor Laboratory (1989) . 
25 Bacterial cells may prove useful as host cells 

suitable for the expression of the recombinant scFvs, 
Fabs and MAbs of the present invention [see, e.g., 
Pluckthun, A., Immunol. Rev. , 130:151-188 (1992)]. The 
tendency of proteins expressed in bacterial cells to be 

3 0 in an unfolded or improperly folded form or in a non- 

glycosylated form does not pose as great a concern 
because Fabs are not normally glycosylated and can be 
engineered for exported expression, thereby reducing the 
high concentration that facilitates misfolding. 
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Nevertheless, any recombinant Fab produced in a 
bacterial cell would be screened for retention of 
antigen binding ability. If the molecule expressed by 
the bacterial cell was produced and exported in a 
5 properly folded form, that bacterial cell would be a 

desirable host. For example, various strains of E. coli 
used for expression are well-known as host cells in the 
field of biotechnology. Various strains of B . subtilis , 
Streptomyces , other bacilli and the like may also be 

10 employed in this method. 

Where desired, strains of yeast cells known to 
those skilled in the art are also available as host 
cells, as well as insect cells, e.g. Drosophila and 
Lepidoptera and viral expression systems [see, e.g. 

15 Miller et al . , Genetic Engineering , 8:277-298, Plenum 
Press (1986) and references cited therein] . 

The general methods by which the vectors of the 
invention may be constructed, the transfection methods 
required to produce the host cells of the invention, and 

2 0 culture methods necessary to produce the altered 

antibody of the invention from such host cell are all 
conventional techniques. Likewise, once produced, the 
altered antibodies of the invention may be purified from 
the cell culture contents according to standard 

2 5 procedures of the art, including ammonium sulfate 

precipitation, affinity columns, column chromatography, 
gel electrophoresis and the like. Such techniques are 
within the skill of the art and do not limit this 
invention . 

30 Yet another method of expression of reshaped 

antibodies may utilize expression in a transgenic 
animal. /An exemplary systems is described in U. S. 
Patent No. 4,873,316. The expression system described 
in that reference uses the animal's casein promoter and, 
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when transgenically incorporated into a mammal, permits 
the female to produce the desired recombinant protein in 
its milk. 

Once expressed by the desired method, the 
5 engineered antibody is then examined for in vitro 

activity by use of an appropriate assay. At present, 
conventional ELISA assay formats are employed to assess 
qualitative and quantitative binding of the altered 
antibody to RSV . Additionally, other in vitro assays 
10 and in vivo animal models may also be used to verify 

neutralizing efficacy prior to subsequent human clinical 
studies performed to evaluate the persistence of the 
altered antibody in the body despite the usual clearance 
mechanisms . 

15 VII. Therapeutic/Prophylactic Uses. 

This invention also relates to a method of treating 
humans experiencing RSV-related symptoms which comprises 
administering an effective dose of antibodies including 
one or more of the antibodies (altered, reshaped, 
2 0 monoclonal, etc.) described herein or fragments thereof. 

The therapeutic response induced by the use of the 
molecules of this invention is produced by binding to 
RSV and thus subsequently blocking RSV propagation. 
Thus, the molecules of the present invention, when in 

2 5 preparations and formulations appropriate for 

therapeutic use, are highly desirable for those persons 
experiencing RSV infection. For example, longer 
treatments may be desirable when treating seasonal 
episodes or the like. The dose and duration of 

3 0 treatment relates to the relative duration of the 

molecules of the present invention in the human 
circulation, and can be adjusted by one of skill in the 
art depending upon the condition being treated and the 
general health of the patient. 
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The altered antibodies, antibodies and fragments 
thereof of this invention may also be used alone or in 
conjunction with other antibodies, particularly human or 
humanized mAbs reactive with other epitopes on the F 
5 protein or other RSV target antigens as prophylactic 
agents . 

The mode of administration of the therapeutic and 
prophylactic agents of the invention may be any suitable 
route which delivers the agent to the host. The altered 

10 antibodies, antibodies, engineered antibodies, and 

fragments thereof, and pharmaceutical compositions of 
the invention are particularly useful for parenteral 
administration, i.e., subcutaneously , intramuscularly, 
intravenously, or intranasally . 

15 Therapeutic and prophylactic agents of the 

invention may be prepared as pharmaceutical compositions 
containing an effective amount of the altered antibody 
of the invention as an active ingredient in a 
pharmaceutically acceptable carrier. An aqueous 

20 suspension or solution containing the antibody, 

preferably buffered at physiological pH, in a form ready 
for injection is preferred. The compositions for 
parenteral administration will commonly comprise a 
solution of the engineered antibody of the invention or 

25 a cocktail thereof dissolved in an pharmaceutically 

acceptable carrier, preferably an aqueous carrier. A 
variety of aqueous carriers may be employed, e.g., 0.4% 
saline, 0.3% glycine, and the like. These solutions are 
sterile and generally free of particulate matter. These 

3 0 solutions may be sterilized by conventional, well known 
sterilization techniques (e.g., filtration) . The 
compositions may contain pharmaceutically acceptable 
auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and 
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buffering agents, etc. The concentration of the 
antibody of the invention in such pharmaceutical 
formulation can vary widely, i.e., from less than about 
0.5%, usually at or at least about 1% to as much as 15 
5 or 20% by weight and will be selected primarily based on 
fluid volumes, viscosities, etc., according to the 
particular mode of administration selected. 

Thus, a pharmaceutical composition of the invention 
for intramuscular injection could be prepared to contain 

10 1 mL sterile buffered water, and between about 1 ng to 
about 10 0 mg, e.g. about 5 0 ng to about 8 0 mg, or more 
preferably, about 5 mg to about 75 mg, of an engineered 
antibody of the invention. Similarly, a pharmaceutical 
composition of the invention for intravenous infusion 

15 could be made up to contain about 250 ml of sterile 
Ringer's solution, and about 1 to about 75 and 
preferably 5 to about 5 0 mg/ml of an engineered antibody 
of the invention. Actual methods for preparing 
parenterally adminis trable compositions are well known 

2 0 or will be apparent to those skilled in the art and are 

described in more detail in, for example, Remington's 
Pharmaceutical Science, 15th ed . , Mack Publishing 
Company, Easton, Pennsylvania. 

It is preferred that the therapeutic and 
25 prophylactic agents of the invention, when in a 

pharmaceutical preparation, be present in unit dose 
forms. The appropriate therapeutically effective dose 
can be determined readily by those of skill in the art. 
To effectively treat an inflammatory disorder in a human 

3 0 or other animal, one dose of approximately 0.1 mg to 

approximately 20 mg per 70 kg body weight of a protein 
or an antibody of this invention should be administered 
parenterally, preferably i.v. or i.m. (intramuscularly) . 
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Such dose may, if necessary, be repeated at appropriate 
time intervals selected as appropriate by a physician. 

The altered antibodies and engineered antibodies of 
this invention may also be used in diagnostic regimens, 
5 such as for the determination of RSV mediated disorders 
or tracking progress of treatment of such disorders. As 
diagnostic reagents, these altered antibodies may be 
conventionally labeled for use in ELISAs and other 
conventional assay formats for the measurement of RSV 

10 levels in serum, plasma or other appropriate tissue, or 
the release by human cells in culture. The nature of 
the assay in which the altered antibodies are used are 
conventional and do not limit this disclosure. 

The antibodies, altered antibodies or fragments 

15 thereof described herein can be lyophilized for storage 
and reconstituted in a suitable carrier prior to use. 
This technique has been shown to be effective with 
conventional immunoglobulins and art-known 
lyophilization and reconstitution techniques can be 

2 0 employed. 

The following examples illustrate various aspects 
of this invention including the construction of 
exemplary engineered antibodies and expression thereof 
in suitable vectors and host cells, and are not to be 

2 5 construed as limiting the scope of this invention. All 

amino acids are identified by conventional three letter 
or single letter codes. All necessary restriction 
enzymes, plasmids, and other reagents and materials were 
obtained from commercial sources unless otherwise 

3 0 indicated. All general cloning ligation and other 

recombinant DNA methodology were as performed in T. 
Maniatis et al . , cited above, or Sambrook et al . , cited 
above . 
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Example 1 : Isolation of GA.-1 scFv-l 

Single chain (sc) Fv libraries were prepared from 
an individual purposely exposed to RSV and selected 
against recombinant RSV F-protein following described 
5 procedures [R. H. Jackson et al , in Protein Engineering-, 
A Practical Approach, A. R. Rees et al eds, Oxford 
University Press, chapter 12, pp. 277-301, 1992; H. R. 
Hoogenboom et al . , Nucl . Acid Res . , 19: 4133-4137 
(1991); J. D. Marks et al . , J. Mol . Biol . , 222: 581-597 

10 (1991) ] . Briefly, lymphocytes were isolated from a 

blood sample taken 15 days post exposure. RNA isolated 
from the lymphocytes was used for preparation of scFv 
encoding repertoires for phage display. Sets of V- 
region primers were paired with constant region primers 

15 for heavy chain domain 1 IgG and IgM and light chain C-K 
and C-A. and then linked in a scFv VH-VL orientation with 
a 15 amino acid spacer (glycine 4 -serine) 3 [SEQ ID NO: 21] 
by overlap PCR [see J. D. Marks et al . , cited above, for 
description of the primers] . 

20 The resulting four scFv repertoires (V-K with IgG 

and IgM, V-A, with IgG and IgM) were cloned into a 
phagemid vector similar to pHENl [H. R. Hoogenboom et 
al . , cited above] resulting in fusion of the scFvs to 
gene III of phage fd. The vector was then transformed 

25 into E. coli (e.g., strain TGI) by elec troporation to 
yield the corresponding phagemid libraries. 

Phage libraries displaying the scFv-gene 3 fusions 
were prepared by infection of each of the plasmid 
libraries with the M13K07 helper phage [R. H. Jackson, 

30 cited above] and were individually subjected to 2 rounds 
of panning against recombinant F-protein coated onto 
plastic. In the first round, 10 11 phage in 2.5 ml 
phosphate buffered saline (PBS) /2% Marval™ non-fat dry 



38 



WO 00/69462 PCT/US00/13694 

milk were incubated for 90 minutes in a tube coated with 
5 jig/ml of F-protein [described in P. Tsui et al , J . 
Immunol . , 157 : 772-780 (1996)] followed by 1 wash with 
lOx PBS/0.05% Tween 20 and a second wash with lOx PBS 
5 alone. Bound phage were eluted with 10 mM triethylamine 
and the eluate was neutralized with 1 M Tris-HCl, pH 
7.4. The eluted phage were amplified and subjected to a 
similar second round of panning, except that the 
concentration of F-protein for coating was 2 |±g/ml and 

10 the wash buffer contained 20x PBS. 

E. coli were infected with the eluted phage and 96 
colonies from each starting library were superinf ec ted 
with helper phage and screened for F-protein binding 
activity. Only four positive clones were obtained from 

15 the 2 IgM libraries, whereas 41 positives were observed 
for the IgG libraries. By partial sequence analysis, 
all of the clones carried one of three different heavy 
chains . Complete sequences were obtained for the heavy 
and light chain V-regions for six clones, all from the 

20 IgG libraries. 

Serial dilutions of titered phage stocks of each of 
these six clones were tested by ELI SA for binding to 
recombinant F-protein and to RSV infected cell lysate . 
All showed binding to F-protein with the phage 

25 designated GX-1 showing the best activity. However, G^- 
1 and three other clones showed little binding to the 
RSV lysate. 

Three clones: GA.-1, G>t-3 (lysate binding 
positive) , and Gk-1 (lysate binding negative) , where "K" 

30 and "X" designate the class of the light chain, were 

characterized further for competition of their binding 
by F-protein specific neutralizing monoclonal 
antibodies, and their ability to inhibit virus 
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infection. The neutralizing mAbs RSV19 and B4 described 
in International patent publication No. WO92/04381, 
published March 19, 1992, and International patent 
publication No. WO93/20210, published October 14, 1993, 
5 recognize distinct epitopes on the F-protein. GK-1 was 
strongly inhibited by both antibodies. GX-1 was 
significantly inhibited by B4 only. Gk-3 was not 
inhibited by either antibody (shown for Gk-1 only; see 
Figs. 1A and IB). In initial assays (Table I, 

10 experiments 1-3), all three clones showed neutralizing 

activity in vitro, with GX-1 being the most potent (Fig. 
2, a graph of experiment 2), while control wild-type 
phage (M13K07) not displaying scFv had no effect. 

To address the possibility that neutralization 

15 might result just from phage coating of virus, 

irrespective of epitope, a phage preparation of the non- 
neutralizing Fab 5-16 was tested in the same assay. In 
three out of four assays, this preparation also showed 
good neutralization activity, as did the control phage 

20 in two of these assays (Table I, experiments 4-7). This 
confounding observation of variable neutralization by 
both Fab 5-16 and control M13K07 phage rendered the 
viral neutralization studies inconclusive. 
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Table I 



Phage 


Virus Neutralization (IC50 x 10 ') 1 
(aru or kru/ml) 2 




Experiment # 




1 


2 


3 


4 


5 


6 


7 


GK-l a 


1, 600 




<300 










E 








<10 


<7 






GA.-1 a 




80 


<300 


















8.1 


11 






c 














120 


Gk-3 a 




900 


<300 


180 








b 












10 




c 














730 


M13K07a 






>10 b 


>10 b 




>5, 000 




b 










+all dil. 


+all dil. 


>10' 


Fab 5-19a 








>10 b 


40 


180 




b 














3 . 5 



Legend: 

10 1 Assay according to M. J. Cannon, J. Virol. Meth. , 
16:293-301. Virus at 100 infectious centers/well 
was incubated with dilutions of the indicated phage 
for 1 hr and then added to susceptible cells for 3 
hr. The virus /phage solution was aspirated and 

15 replaced with fresh medium and the cells were 

incubated overnight before peroxidase staining for 
virus infected cells. 



2 aru = ampicillin resistance units, a measure of 

20 phagmid containing particles. 

kru = kanamycin resistance units, a measure of 
particles containing the phage genome (for the 
M13K07 control only) . 
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In the face of these results, made more ambiguous 
by the dependence of all assays on phage stocks verses 
antibody proteins of known concentration, GA.-1 was 
selected as the most likely candidate for a potent 
5 neutralizing antibody based on (1) its apparent better 
binding to F-protein, (2) its selective inhibition of 
binding by the B4 antibody, and (3) its suggested 
activity over background in the virus neutralization 
assay . 

10 

Example 2 : Conversion of GX-1 scFV to mAb Version A 

The DNA and encoded protein sequences of the VH and 
VL regions of GX-1 are shown in Figs. 3 [ SEQ ID NOS: 1 
and 2] and 4 [SEQ ID NOS : 3 and 4], respectively. For 

15 expression in mammalian cells, the heavy chain variable 
region and the light chain variable region from the GX-1 
plasmid were cloned into derivatives of plasmid pCDN 
[Nambi, A. et al . , Mol . Cell. Biochem. , 131 ; 75-86 
(1994)] in which the expression of the antibody chain is 

20 driven by the cytomegalovirus promoter (CMV) promoter. 
Plasmid pCD-HC68B is used for expressing full length 
heavy chains and plasmid pCN-HuLC, for expressing full 
length light chains. 

In the initial constructs, changes in the sequence 

2 5 at the amino terminus were introduced by the PCR primers 
used for cloning the light chain and heavy chain 
variable regions from plasmid GX-l . In these 
constructs, the peptide signal sequence for both the 
heavy and light chains is derived from the Campath light 

30 chain [M. J. Page et al . , Biotechnology 9: 64-68 

(1991) ] . The heavy chain of GX-1 was PCR amplified from 
GX-1 phagemid DNA, using primers for the amino terminus 
and framework 4 of the variable region. The resulting 
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PCR fragment was cut with Xhol (site introduced by the 
amino terminus primer) and BstEII (naturally occurring 
site in framework 4), and cloned into an intermediate 
vector, F4HCV, at the XhoI/BstEII sites. 
5 This cloning grafted the variable region of GX-1 

onto the constant region of another anti-RSV heavy chain 
194-F4 [cloned at SmithKline Beecham from a human 
hybridoma] . This intermediate clone was cut with Xhol 
and Bspl20l, and introduced into the same sites in pCD- 

10 HC68B. The Xhol site is introduced at the amino 

terminus by the PCR primer and, when cloned into pCD- 
HC68B at the same site is preceded in frame by the 
Campath leader sequence. The Bspl20l site is a 
naturally occurring, highly conserved sequence at the 

15 beginning of the C H -i domain, and when cloned into pCD- 
HC68B at the same site, is in frame with the remaining 
sequence for the C H -i through C H -3 regions of human IgGi . 
In the resulting construct, G^-lApcd (Figs. 8A-8F [SEQ 
ID NO: 13]), the amino acids immediately following the 

20 Campath leader are EVQLLE [SEQ ID NO: 17], where the 

residues LE are encoded by the nucleotide sequence for 
the Xhol cloning site. 

The light chain of GX-1 was PCR amplified from the 
G?i-1 phagemid DNA, using primers for the amino terminus 

25 and framework 4 of the variable region. The resulting 
PCR fragment was cut with SacI (site introduced by the 
amino terminus primer) and Avrll (naturally occurring 
site in framework 4) , and cloned into 43-lpcn at the 
Sacl/Avrll sites. This cloning grafted the variable 

30 region of GX-1, in frame, onto the constant region of 

another anti-RSV lambda light chain 43 [P. Tsui et al . , 
J. Immunol . , 157: 772-780 (1996)], which had been cloned 
at SmithKline Beecham from a combinatorial library 
derived from RNA isolated from human spleen. The SacI 
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site is introduced at the amino terminus by the PCR 
primer and, when cloned into 43pcn at the same site, is 
preceded in frame by the Campath leader sequence. The 
first two amino acids of the mature light chain are 
5 therefore deleted. In the resulting construct, GX-lApcn 
(Figs. 9A-9E [SEQ ID NO: 14]), the first two amino acids 
immediately following the leader are EL, where the 
residues EL are encoded by the nucleotide sequence for 
the SacI cloning site. 
10 The nucleotide sequences of the plasmids GA-lApcd 

and G^-lApcn are shown in Figs. 8A-8F [SEQ ID NO: 13] 
and 9A-9E [SEQ ID NO: 14] respectively. This set of 
vectors was used to produce antibody GX-lA in COS cells 
and in CHO cells. 

15 

Example 3 : Cloning Of The Corrected GX-1 Heavy and Light 
Chains 

In cloning the variable region of the GX-1 heavy 
chain from the single chain Fv (scFv) format into the 

20 full length format, the fifth amino acid at the amino 
terminus was changed from Val to Leu, for cloning 
purposes. To correct this change, PCR primers were 
designed for the amino terminus of the GX-1 heavy chain 
cloned into pCD, which reverted the fifth amino acid 

25 back to Val. The correction was introduced via the PCR 
overlap technique using the correction primers and 
primers annealing to sequences within the CMV promoter 
and the C H -2 constant region as the outside 5 ' and 3 ' 
primers, respectfully. The final PCR product was 

30 digested with restriction enzymes, EcoRI and Bspl20l, 

and cloned into the GX-lApcd vector at the same sites to 
create G/V-lBpcd. 
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The final construct was sequenced to verify that 
the amino terminus of the heavy chain had been corrected 
from EVQLLE [ SEQ ID NO: 17] to EVQLVE [ SEQ ID NO: 18] 
(see Fig 6) . The nucleotide sequence of coding region 
5 for the corrected heavy chain, GA.-1B, is shown in Figs. 
10A-10B [SEQ ID NO: 15] . 

In cloning the variable region of the GA.-1 light 
chain from the scFv format into the full length format, 
changes were introduced at the amino terminus for 

10 cloning purposes. Specifically, the first 2 amino acids 
(Gin and Ser) of the light chain were deleted and the 
third amino acid was changed from Val to Glu. To 
correct these changes, PCR primers were designed for the 
amino terminus of the GX-1 light chain cloned into pCN, 

15 which replaced the two deleted amino acids (Gin and Ser) 
and reverted the third amino acid back to Val. The 
corrections were introduced via the PCR overlap 
technique using the correction primers and primers 
annealing to sequences within the CMV promoter and the X 

20 constant region as the outside 5' and 3' primers, 

respectfully. The final PCR product was digested with 
restriction enzymes, EcoRI and Avrll and cloned into the 
GX-lApcn vector at the same sites to create G?t-lBpcn. 
The final construct was sequenced to verify that 

25 the amino terminus of the light chain had been corrected 
from — EL to QSVL (amino acids 1-4 of SEQ ID NO: 10) . 

The nucleotide sequence of coding region for the 
corrected light chain, GX-1B, is shown in Fig. 11 [SEQ 
ID NO: 16] . This vector GA.-lBpcn, was used with GX- 

3 0 lBpcd to produce antibody GA.-1B, in COS cells and in CHO 
cells . 
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Example 4 : Production of G^-l mABs in Mammalian Cells 

For initial characterization, the mAb constructs 
for each version, G>^-1A heavy and light chain, GX.-1B 
heavy and light chain, were expressed in COS cells 
5 essentially as described in Current Protocols in 

Molecular Biology, eds F . M. Ausubel et al . , 1988, John 
Wiley & Sons, vol. 1, section 9.1. On day 1 after the 
transf ection, the culture growth medium was replaced 
with a serum-free medium [SmithKline Beecham] which was 

10 changed on day 3 . Similar satisfactory results are 
obtained using a publicly available medium, DMEM 
supplemented with ITS™ Premix, an insulin, transferrin, 
selenium mixture (Collaborative Research, Bedford, MA) 
and 1 mg/ml bovine serum albumin (BSA) . 

15 The mAb was prepared from the day 3 + day 5 

conditioned medium by standard protein A affinity 
chromatography methods (e.g., as described in Protocols 
in Molecular Biology) using, for example, Prosep A 
affinity resin (Bioprocessing Ltd., UK). 

20 To produce larger quantities of the G?i-1B mAB (100- 

200 mgs), the vectors were introduced into a proprietary 
CHO cell system. However, similar results will be 
obtained using dhfr" CHO cells as previously described 
[P. Hensley et al . , J. Biol. Chem . , 269 : 23949-23958 

25 (1994)]. Briefly, a total of 30 (Ig of linearized 

plasmid DNA (15 ug each of the A or B set of heavy chain 
and light chain vectors) is electroporated into lxlO 7 
cells. The cells are initially selected in nucleoside- 
free medium in 96 well plates. After three to four 

3 0 weeks, media from growth positive wells is screened for 
human immunoglobulin using an ELISA assay. The highest 
expressing colonies are expanded and selected in 
increasing concentrations of methotrexate for 
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amplification of the transfected vectors. The antibody 
is purified from conditioned medium by standard 
procedures using protein A affinity chromatography 
(Protein A sepharose, Pharmacia) followed by size 
5 exclusion chromatography (Superdex 200, Pharmacia). 

The concentration and the antigen binding activity 
of the eluted antibody are measured by ELISA. The 
antibody containing fractions are pooled and further 
purified by size exclusion chromatography. As expected 

10 for any such antibody, by SDS-PAGE, the predominant 

protein product migrated at approximately 15 0 kd under 
non-reducing conditions and as two bands of 50 and 25 kd 
under reducing conditions. For antibody produced in CHO 
cells, the purity was > 9 0%, as judged by SDS-PAGE, and 

15 the concentration was accurately determined by amino 
acid analysis. 



Example 5 : Binding of the GX-1 mABs to recombinant F 
protein 

2 0 Binding of the GA.-1 mABs to recombinant F protein 

was measured in a standard solid phase ELISA. Antigen 
diluted in PBS pH 7 . 0 was adsorbed onto polystyrene 
round-bottom microplates (Dynatech, Immunolon II) for 18 
hours. Wells were then aspirated and blocked with 0.5% 
25 boiled casein (BC) in PBS containing 1% Tween 2 0 

(PBS/0.05% BC) for two hours. Antibodies (50 (J.l/well) 
were diluted to varying concentrations in PBS/0.5% BC 
containing 0.025% Tween 20 and incubated in antigen 
coated wells for one hour. Plates were washed three 

3 0 times with PBS containing 0.05% Tween 20, using a 

Titertek 320 microplate washer, followed by addition of 
HRP-labelled protein A/G (50 diluted 1:5000. After 

washing three times, TMBlue substrate (TSI, #TM102) was 
added and plates were incubated an additional 15 



WO 00/69462 PCT/USOO/13694 

minutes. The reaction was stopped by addition of 1 N 
H 2 S0 4 and absorbance read at 450 nm using a Biotek ELISA 
reader . 

The antigen binding epitope of the GX-1 mABs was 
5 examined in a competition ELISA. The GX-1 mABs were 
mixed with increasing concentrations of RSMU19 or B4 , 
two potent neutralizing mAbs [Tempest et al . , Biotech . , 
9: 266-271 (1991); Kennedy et al . , J. Gen. Virol. , 69: 
3023-3032 (1988)] and added to F protein-coated wells. 

10 The epitope regions recognized by mAbs RSMU19 and B4 are 
quite distinct from each other as previously described 
in Arbiza et al., J. Gen . Virol . , 73: 2225-2234 (1992). 
The concentration of the Gk-1 mABs used in competition 
studies was determined previously to give 90% maximal 

15 binding to F antigen. Binding of the GX.-1 mABs in the 
presence of other mABs was detected using HRP-labelled 
goat anti-human IgG. The reaction was developed as 
stated above. 

The GX.-1 mABs demonstrated potent binding to 

20 recombinant F (rF) protein by ELISA (EC 50 for mAB B = 2.6 
ng/ml) . Binding of the GA.-1 mABs to rF protein was 
inhibited by mAb B4 , for which the F protein amino acids 
critical for antigen recognition are amino acids 2 68, 
272 and 275 of SEQ ID NO : 20) . Binding of the GA.-1 mABs 

25 to rF protein was not inhibited by mAb RSMU19, for which 
F protein amino acid 429 of SEQ ID NO: 20 is critical 
for antigen recognition. These results indicate that 
residues in the region of amino acids 255-275 of the F 
protein [SEQ ID NO: 20] are critical for GX-1 mAB 

30 recognition. 
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Example 6 : In vitro Fusion-Inhibition Activity of the 



GX-1 mABs 

The ability of the GX-1 mABs to inhibit virus- 
induced cell fusion was determined using a modification 
5 of the in vitro microneutralizat ion assay [Beeler et 

al., J. Virol . , 63:2941-2950 (1989)]. In this assay, 50 
(J.1 of RS Long strain virus (10-100 TCID 50 /well [American 
Type Culture Collection ATCC VR-26] were mixed with 0.1 
ml VERO cells (5X10 3 /well) [ATCC CCL-81] in Minimum 

10 Essential Media (MEM) containing 2% fetal calf serum 
(FCS) , for 4 hours at 37°C, 5% C0 2 - Serial two-fold 
dilutions (in quadruplicate) of mAB (50 ul ) were then 
added to wells containing virus-infected cells. Control 
cultures contained cells incubated with virus only 

15 (positive virus control) or cells incubated with media 
alone . 

Cultures were incubated at 37°C in 5% CO2 for 6 days 
at which time cytopathic effects (CPE) in virus control 
wells were > 90%. Microscopic examination for 

20 cytopathic effects were confirmed by ELISA. Media was 
aspirated from cultures and replaced with 50 (ll of 90% 
methanol containing 0.6% H 2 0 2 . After 10 minutes, 
fixative was aspirated and plates were air dried 
overnight. Viral antigen was detected in the fixed 

25 cultures using 1 (ig/ml biotinylated RSCHB4 (a human Fc 
derivative of the bovine B4 mAb [SmithKline Beecham] ) , 
followed by HRP-labelled streptavidin (Boehringer- 
Mannheim) diluted 1:10,000. The reaction was developed 
using TMBlue and stopped by addition of IN H 2 S0 4 . 

30 Absorbance was measured at 450 nm (O.D. 450 ) . 

Fusion-inhibition titers were defined as the 
concentration of antibody which caused a 50% reduction 
in ELISA signal (ED 50 ) as compared to virus controls. 
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Based on the curve generated in the ELISA by the 
standard virus titration, a 50% reduction in O.D.450 
corresponded to > 90% reduction in virus titer. 
Calculation of the 50% point was based on regression 
5 analysis of the dose titration. 

The GA.-1 mABs demonstrated potent in vitro fusion- 
inhibition activity against type A RS Long strain virus 
(ED 50 for mAB B of 0.51 + 0.3 8 pg/ml) . In this in vitro 
fusion-inhibition assay, GX.-1 mAB B was more active than 
10 the humanized mAB RSHZ19 (ED 50 of 0.4-3.0 (Jg/ml) [Wyde et 
al . , Pediatr . Res . , 3 8 ( 4 ) : 543 - 55 0 ] in comparative 
assays . 

Example 7 : In vivo Activity of GX-1 mAB B: Prophylaxis 

15 and Therapy in Balb/c Mouse Model 

Balb/c mice (5/group) were inoculated 
intraperitoneally with doses ranging from 0.06 mg/kg to 
5 mg/kg of GX-1 mAB B either 24 hours prior 
(prophylaxis) or 4 days after (therapy) intranasal 

20 infection with 10 5 PFU of the A2 strain of human RSV. 

Mice were sacrificed 5 days after infection. Lungs were 
harvested and homogenized to determine virus titers. 

Virus was undetectable in the lungs of mice treated 
prophylactically with > 1.2 5 mg/kg GA.-1 mAB B either 

25 prophylactically or therapeutically. See Table II 

below. Significant viral clearance (2-3 logio) was also 
achieved in animals receiving 0.31 mg/kg GX-1 mAB B 
either prophylactically or therapeutically. 

30 
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Table II: GA.-1 mAB B Prophylaxis and Therapy in Balb/c 
Mice 

Dose Lung Virus Titer ( logic /g lung) 

Treatment ( mg / kg ) Prophylaxis Therapy 
5 ~" " "™ 

G/.-l mAB B 5 <1 . 7 <1.7 

1.25 <1.7 <1.7 

0.31 1.8+0.3 2.9+0.4 

0.06 4.3+0.7 4.5+0.3 

10 

PBS - 4.8+0.7 4.7+0.2 

The GX-1 mABs have potent antiviral activity 
in vitro against a broad range of native RSV isolates of 

15 both type A and B , and show prophylactic and therapeutic 
efficacy in vivo in animal models. Thus, the GA.-1 mABs 
are candidates for therapeutic, prophylactic, and 
diagnostic application in man. 

Numerous modifications and variations of the 

2 0 present invention may be made by one of skill in the art 
in view of the invention described herein. Such 
modifications are believed to be encompassed by the 
specification and claims of the present invention. All 
references cited above are incorporated by reference 

2 5 herein. 
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1 . A human monoclonal antibody and 
functional fragments thereof, specifically reactive with 
an F protein epitope of Respiratory Syncytial Virus and 
capable of neutralizing infection by said virus selected 
from the group consisting of GX-1A and GX-lB. 

2 . The monoclonal antibody according to 
Claim 1 which comprises the light chain amino acid 
sequence of Fig. 3 SEQ ID NO: 2 and the heavy chain 
amino acid sequence of Fig. 4 SEQ ID NO: 4. 

3 . The monoclonal antibody according to 
Claim 1 which comprises the light chain amino acid 
sequence encoded by the DNA sequence of Fig. 11 SEQ ID 
NO: 16 and the heavy chain amino acid sequence encoded 
by the DNA sequence of Figs. 10A-10B SEQ ID NO: 15. 

4. The monoclonal antibody according to 
Claim 1 wherein said fragment is selected from the group 
consisting of Fv, Fab and F(ab') 2 - 

5 . An isolated nucleic acid molecule 
selected from the group consisting of: 

(a) a nucleic acid sequence encoding any 
of the human monoclonal antibodies, altered antibodies 
and CDRs of any of the claims 1-4; 

(b) a nucleic acid complementary to any 
of the sequences in (a) ; and 

(c) a nucleic acid sequence of 18 or more 
nucleotides capable of hybridizing to the CDRs of any of 
claims 1-4 under stringent conditions. 
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6. The isolated nucleic acid molecule 
according to Claim 5 comprising the sequences of Figs . 
8A-8F and 9A-9E SEQ ID NOS : 13 and 14, or Figs. 10A-10B 
and 11 SEQ ID NOS: 15 and 16. 

7 . A recombinant plasmid comprising the 
nucleic acid sequences of any of Claims 5 or 6 . 

8. A host cell comprising the plasmid of 

Claim 7 . 

9 . A process for the production of a human 
antibody specific for RSV comprising culturing the host 
cell of Claim 8 in a medium under suitable conditions of 
time temperature and pH and recovering the antibody so 
produced. 

10 . A method of detecting RSV comprising 
contacting a source suspected of containing RSV with a 
diagnostically effective amount of the monoclonal 
antibody of Claim 1 and determining whether the 
monoclonal antibody binds to the source. 

11 . A method for providing passive 
immunotherapy to RSV disease in a human, comprising 
administering to the human an immuno therapeutically 
effective amount of the monoclonal antibody of Claim 1. 

12 . The method according to Claim 11 wherein 
the passive immunotherapy is provided prophylactically . 

13 . A pharmaceutical composition comprising 
at least one dose of an immuno therapeutically effective 
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amount of the monoclonal antibody of Claim 1 in a 
pharmaceutical^ acceptable carrier. 



14 . A pharmaceutical composition comprising 
at least one dose of an immuno therapeutically effective 
amount of the monoclonal antibody of Claim 1 in 
combination with at least one additional monoclonal 
antibody . 

15. The pharmaceutical composition according 
to Claim 14 wherein said additional monoclonal antibody 
is an anti-RSV antibody distinguished from the antibody 
of Claim 1 by virtue of being reactive with a different 
epitope of the RSV F protein antigen. 
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Fig. 1A 

RSV19/GI1 scFv phage competition 




0.0OE+00 2.00E+08 4.00E+08 6.00E+08 8.00E+08 1.00E+09 
a.r.u 



B4/GI1 scFv phage competition 

0.45 -r 




O.OOE+00 2.00E+08 4.00E+08 6.00E+08 8.00E+08 1.00E+09 
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Fig. 2 

Neutralisation of RS/V/273 with phage Fv 




G lambda 3 G lambda 1 G kappa 1 
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FIGURE 3 

1 CAGTCTGTGTTGACGCAGCCGCCCTCAGTCTCTGCGGCCCCAGGACAGAA 5 0 

QSVLTQPPSVSAAPGQK 



51 GGTCACCATCTCCTGCACTGGGAGCAGCTCCAACCTCGGGGCAGGTTATG 100 
VT I SCTGS S SNLGAGYD 



101 ATGTTCACTGGTACCGGCAACTTCCAGGGACAGCCCCCAAACTCCTCATC 15 0 
VHWYRQL PGTAPKLL I 



151 TATGATAACAACAATCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTC 2 00 
YDNNNRPSGVPDRFSGS 



2 01 CAAGTCTGGCCCCTCAGCCTCCCTGGCCATCTCTGGGCTCCAGGCTGAGG 2 50 
KSGPSASLAI SGLQAED 



251 ATGAGGCTGATTATTACTGCCAGTCCTATGACAGCAGCCTGAATGGTTAT 3 00 
EADYYCQSYDSSLNGY 



3 01 GTCTTCGGAACTGGGACCCAGCTCACCGTCCTAGGT 
VFGTGTQLTVLG 
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1 GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTACAGCCTGGGGGGTC 5 0 

EVQLVESGGGLVQPGGS 



51 CCTGAGACTCTCCTGCGCAGCCTCTGGAGTCTCCCTCAGTGGATACAAGA 10 0 
LRLSCAASGVSLSGYKM 



101 TGAACTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAATGGGTCTCTTCC 15 0 
NWVRQAPGKGLEWVSS 



151 ATTACTGGTATGAGTAATTACATACACTACTCAGACTCAGTGAAGGGCCG 2 0 0 
ITGMSNYIHYSDSVKGR 



2 01 ATTCACCATCTCCAGAGACAACGCCATGAACTCACTGTATCTGCAAATGA 25 0 
FTISRDNAMNSLYLQMN 



2 51 ACAGCCTGACAGCCGAGGACACGGGTGTTTATTATTGTGCGACACAACCG 3 0 0 
SLTAEDTGVYYCATQP 



3 01 GGGGAGCTGGCGCCTTTTGACCATTGGGGCCAGGGAACCCTGGTCACCGT 350 
GELAPFDHWGQGTLVTV 



3 51 CTCCTCA 
S S 
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Figure 5 
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FIGURE 6 

Comparison of the Heavy Chain Amino Acid Sequences of the 
GX-1 single chain fv and mAbs 

Leader and Variable Regions 

GL Dp58: EVQLVESGGGLVQPGGSLRLSCAASGFTFS 

GX-1 scFv: VSL- 

GX-1A: MGWSCIILFLVATATGVHS L 

GA.-1B: V 

CDR1 CDR2 

GL Dp58: S YEMNWVRQAPGKGLEWVS YI S S SGSTI YYADSVKGRFTI SRDNAKNSL Y 

GX-1 scFV: G-K S-TGMSNY-H-S M 

GX-1A: 

GA.-1B: 

CDR3 

GL: Dp58: LQMNSLRAEDTAVYYCAR 

GX-1 scFv: T G TQPGELAPFDHWGQGTLVTVSS 

GX-1A: 

GX-1B: 
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Comparison of the Light Chain Amino Acid Sequences of the G/^-lA: 
single chain Fv and mAbs 

Leader and Variable Regions 



FIG 7 

Comparison of the Light Chain Amino Acid Sequences of the G/^-lA: 
single chain Fv and mAbs 

Leader and Variable Regions 

CDRl 

GL DpL8: QSVLTQPPSVSGAPGQRVTISCTGSSSNIG 

CX-\ scFv: A K L- 

Gk-\A: MGWSCIILFLVATATGVHS E 

G>l-1B: QSV 

CDR2 

GL DpL8: AGYDVHWYQQLPGTAPKLLIYGNSNRPSGVPDRFSGSKSGTSASLAITGL 

GVlscFv: R D-N P S — 

GX.-1A: 

GA.-1B: 

CDR3 

GL DpL8: QAEDEADYYC 

GX-] scFv: QSYDSSLNGYVFGTGTQLTVLG 

GX-1A: 

GA.-1B: 
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FIGURE 8A 

1 gacgtcgcggccgctctaggcctccaaaaaagcctcctcactacttctgg 

51 aatagctcagaggccgaggcggcctcggcctctgcataaataaaaaaaat 

101 tagtcagccatgcatggggcggagaatgggcggaactgggcggagttagg 

151 ggcgggatgggcggagttaggggcgggactatggttgctgactaattgag 

2 01 atgcatgctttgcatacttctgcctgctggggagcctggggactttccac 

2 51 acctggttgctgactaattgagatgcatgctttgcatacttctgcctgct 

3 01 ggggagcctggggactttccacaccctaactgacacacattccacagaat 
3 51 taattcccggggatcgatccgtcgacgtacgactagttattaatagtaat 
401 caattacggggtcattagttcatagcccatatatggagttccgcgttaca 
451 taacttacggtaaatggcccgcctggctgaccgcccaacgacccccgccc 
5 01 attgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt 

5 51 tccattgacgtcaatgggtggactatttacggtaaactgcccacttggca 

6 01 gtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatga 
651 cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggact 

7 01 ttcctacttggcagtacatctacgtattagtcatcgctattaccatggtg 

7 51 atgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacg 

8 01 gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggc 
851 accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattg 

9 01 acgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagc 

EcoRI 

9 51 tgggtacgtgaaccgtcagatcgcctggagacgccatc gaattc tgagca 

10 01 cacaggacctcacc atg ggatggagctgtatcatcctcttcttggtagca 
MGWSCI ILFLVA 
Leader start 

Xhol 

10 51 acagctacaggtgtccactccgaggtccaactgctcgagtctgggggagg 

T A T G V H S E V Q L L E S 

Processed N-term 
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FIGURE 8B 

1101 cttggtacagcctggggggtccctgagactctcctgcgcagcctctggag 

1151 tctccctcagtggatacaagatgaactgggtccgccaggctccagggaag 

12 01 gggctggaatgggtctcttccattactggtatgagtaattacatacacta 

12 51 ctcagactcagtgaagggccgattcaccatctccagagacaacgccatga 

13 01 actcactgtatctgcaaatgaacagcctgacagccgaggacacgggtgtt 

13 51 tattattgtgcgacacaaccgggggagctggcgccttttgaccattgggg 

BstEII Bspl20I 

14 01 ccagggaaccct ggtcacc gtctcctcagcctccaccaa gggccc atcgg 

QGTLVTVSS/ 

framework IV / CHI 

14 51 tcttccccctggcaccc tec tccaagagcacctctgggggcacagcggcc 

1501 ctgggctgcctggtcaaggactacttccccgaaccggtgacggtgtcgtg 

1551 gaactcaggcgccctgaccagcggcgtgcacaccttcccggc tgtcc tac 

BstEII 

1601 agtcc tcaggac tc tac tccc tcagcagcgtggtgaccgtgcc etc cage 

1651 agcttgggcacccagacctacatctgcaacgtgaatcacaagcccagcaa 

17 01 caccaaggtggacaagaaagttgagcccaaatcttgtgacaaaactcaca 

17 51 catgcccaccgtgcccagcacctgaactcctggggggaccgtcagtcttc 

1801 ctcttccccccaaaacccaaggacaccctcatgatctcccggacccctga 

1851 ggtcacatgcgtggtggtggacgtgagccacgaagaccctgaggtcaagt 

19 01 tcaactggtacgtggacggcgtggaggtgcataatgccaagacaaagccg 

1951 cgggaggagcagtacaacagcacgtaccgggtggtcagcgtcctcaccgt 

2 001 cctgcaccaggactggctgaatggcaaggagtacaagtgcaaggtctcca 

2 051 acaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggg 

2101 cagccccgagaaccacaggtgtacaccctgcccccatcccgggatgagct 

2151 gaccaagaaccaggtcagcctgacctgcctggtcaaaggcttctatccca 



WO 00/69462 



10 / 21 



PC17US00/13694 



FIGURE 8C 

22 01 gcgacatcgccgtggagtgggagagcaatgggcagccggagaacaactac 

22 51 aagaccacgcctcccgtgctggactccgacggctccttctccctctacag 

23 01 caagc tcaccgtggacaagagcaggtggcagcaggggaacgtc ttc teat 

23 51 gctccgtgatgcatgaggctctgcacaaccactacacgcagaagagcctc 

2 4 01 tccctgtc teegggtaaatgatagatate tacgtatgatcagcc tcgac t 
S P G K * C-term of heavy chain 

2 451 gtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttc 

2501 cttgaccc tggaaggtgccac tcccac tgtcc t t tec taa taaaatgagg 

2 551 aaattgcatcgcattgtctgagtaggtgtcat tctattctggggggtggg 

2 6 01 gtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgc 

2 651 tggggatgcggtgggctctatggaaccagctggggctcgacagcgctgga 

27 01 tctcccgatccccagctttgcttctcaatttcttatttgcataatgagaa 

27 51 aaaaaggaaaattaattttaacaccaattcagtagttgattgagcaaatg 

2801 cgttgccaaaaaggatgctttagagacagtgttctctgcacagataagga 

2 851 caaacattattcagagggagtacccagagctgagactcctaagccagtga 

2901 gtggcacagcattctagggagaaatatgcttgtcatcaccgaagcctgat 

2 9 51 tccgtagagccacaccttggtaagggccaatctgctcacacaggatagag 

3 001 agggcaggagccagggcagagcatataaggtgaggtaggatcagttgctc 
3 051 ctcacatttgcttctgacatagttgtgttgggagcttggatagcttggac 
3101 agctcagggctgcgatttcgcgccaaac ttgacggcaatcctagcgtgaa 
3151 ggctggtaggatt ttatccccgctgccatcatggttcgaccattgaactg 
32 01 catcgtcgccgtgtcccaaaatatggggattggcaagaacggagacctac 

32 51 cctggcctccgctcaggaacgagttcaagtacttccaaagaatgaccaca 
3 3 01 acctcttcagtggaaggtaaacagaatctggtgattatgggtaggaaaac 

33 51 ctggttc tccattcctgagaagaatcgacctttaaaggacagaattaata 
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FIGURE 8D 

3 401 tagttctcagtagagaactcaaagaaccaccacgaggagc tcattttctt 

3 451 gccaaaagtttggatgatgccttaagacttattgaacaaccggaattggc 

3 501 aagtaaagtagacatggtttggatagtcggaggcagttc tgtttaccagg 

3 551 aagccatgaatcaaccaggccacc ttagac tct ttgtgacaaggatcatg 

3 601 caggaatttgaaagtgacacgtttttcccagaaattgatttggggaaata 

3 651 taaacttctcccagaatacccaggcgtcctctctgaggtccaggaggaaa 

3 7 01 aaggcatcaagtataagtttgaagtctacgagaagaaagactaacaggaa 

3 751 gatgc tttcaagt tc tctgctcccc tec taaagc tatgeat 1 1 1 tataag 

3 8 01 accatgggacttttgctggctttagatcagcctcgactgtgccttctagt 

3 851 tgccagccatctgttgtttgcccctcccccgtgccttccttgaccctgga 

3 901 aggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgc 

3 951 attgtc tgagtaggtgtcattctattctggggggtggggtggggcaggac 

4 0 01 agcaagggggaggat tgggaagacaatagcaggcatgc tggggatgcggt 
4 051 gggctctatggaaccagctggggctcgatcgagtgtatgactgcggccgc 
4101 gatcccgtcgagagcttggcgtaatcatggtcatagctgtttcctgtgtg 
4151 aaattgttatccgctcacaattccacacaacatacgagccggaagcataa 

42 01 agtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcg 
4251 ttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgca 

43 01 ttaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgct 
43 51 cttccgcttcctcgctcactgactcgctgcgc teggtegt teggctgegg 
4401 cgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatc 
4451 aggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggcca 
4501 ggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccc 
4551 cctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaaccc 
4601 gacaggactataaagataccaggcgt t tccccc tggaagc tccc tcgtgc 
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FIGURE 8E 

4651 gctc tcctgttccgaccc tgccgc ttaccggatacc tgtccgcc tttc tc 

47 01 ccttcgggaagcgtggcgctttctcaatgctcacgctgtaggtatctcag 

47 51 ttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccg 

48 01 ttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaac 
4851 ccggtaagacacgac t ta tcgccac tggcagcagccac tggtaacaggat 
4901 tagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggc 
4951 ctaactacggctacactagaaggacagtatttggtatctgcgc tctgctg 
5 001 aagccagttaccttcggaaaaagagttggtagctcttgatccggcaaaca 
5051 aaccaccgctggtagcggtggtt t t tt tgtt tgcaagcagcagat tacgc 
5101 gcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtct 
5151 gacgctcagtggaacgaaaactcacgttaagggattttggtcatgagatt 

52 01 atcaaaaaggatcttcacctagatccttttaaattaaaaatgaagtttta 
5251 aatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgc 

53 01 ttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccat 
53 51 agttgcctgactccccgtcgtgtagataac tacgatacgggagggc ttac 
5401 catctggccccagtgctgcaatgataccgcgagacccacgctcaccggct 
5451 ccagatttatcagcaataaaccagccagccggaagggccgagcgcagaag 
5501 tggtcctgcaactttatccgcctccatccagtctattaattgttgccggg 
5551 aagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgcc 

56 01 attgctacaggcatcgtggtgtcacgctcgtcgt ttggtatggc t teat t 
5651 cagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgt 

57 01 gcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaag 

57 51 ttggccgcagtgttatcactcatggttatggcagcactgcataattctc t 

58 01 tactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaa 
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FIGURE 8F 

5851 ccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccg 

5901 gcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgct 

5951 catcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgc 

60 01 tgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttca 

6051 gcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggca 

6101 aaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactca 

6151 tactcttcctttttcaatattattgaagcatttatcagggttattgtctc 

62 01 atgagcggatacatatt tgaatgtat ttagaaaaataaacaaatagggg t 

62 51 tccgcgcacatttccccgaaaagtgccacct 
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FIGURE 9 A 

1 gacgtcgcggccgctctaggcctccaaaaaagcctcctcaccacttctgg 

51 aatagctcagaggccgaggcggcctcggcctctgcataaataaaaaaaat 

101 tagtcagccatgcatggggcggagaatgggcggaactgggcggagttagg 

151 ggcgggatgggcggagttaggggcgggactatggttgctgactaattgag 

2 01 atgcatgctttgcatacttctgcctgctggggagcctggggactttccac 

2 51 acctggttgctgactaattgagatgcatgctttgcatacttctgcctgct 

3 01 ggggagcc tggggac tttccacaccctaactgacacacattccacagaat 
3 51 taattcccggggatcgatccgtcgacgtacgactagttattaatagtaat 
401 caattacggggtcattagttcatagcccatatatggagttccgcgttaca 
451 taacttacggtaaatggcccgcctggctgaccgcccaacgacccccgccc 
501 attgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt 
551 tccattgacgtcaatgggtggactatttacggtaaactgcccacttggca 
601 gtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatga 
651 cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggact 
7 01 ttcctacttggcagtacatctacgtattagtcatcgctattaccatggtg 
751 atgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacg 
801 gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggc 
851 accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattg 
9 01 acgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagc 

EcoRI 

951 tgggtacgtgaaccgtcagatcgcctggagacgccatc gaattc tgagca 

10 01 cacaggacct caec a tg ggatggagctgtatca tec tcttcttggtagca 
MGWSCI I L F L V A 
Leader start 

Sad 

1051 acagctacaggtgtccactccgagctcacgcagccgccctcagtctctgc 
T A T G V H S E L T Q -- 

Processed N-term 
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FIGURE 9B 

1101 ggccccaggacagaaggtcaccatctcctgcactgggagcagctccaacc 
1151 tcggggcaggttatgatgttcac tggtaccggcaac ttccagggacagcc 
12 01 cccaaactcctcatctatgataacaacaatcggccctcaggggtccctga 

12 51 ccgattctctggctccaagtctggcccctcagcctccctggccatctctg 

13 01 ggctccaggctgaggatgaggctgattattactgccagtcctatgacagc 

Avrll 

13 51 agcctgaatggttatgtcttcggaactgggacccagctcaccgtcctagg 

T Q L T V L G 
Framework IV / CX 

14 01 tcagcccaaggctgccccctcggtcactctgttcccgccctcctctgagg 
1451 agcttcaagccaacaaggccacactggtgtgtctcataagtgacttctac 

15 01 ccgggagccgtgacagtggcc tggaaggcaat tagcagccccgtcaaggc 
1551 gggagtggagaccaccacaccctccaaacaaagcaacaacaagtacgcgg 

16 01 ccagcagctatctgagcctgacgcctgagcagtggaagtcccacagaagg 
1651 tacagctgccaggtcacgcatgaagggagcaccgtggagaagacagtggc 

17 01 ccctacagaatgttcatagttctagatctacgtatgatcagcctcgactg 

P T E C S * C-term light chain 

17 51 tgccttctagttgccagccatctgttgtttgcccctcccccgtgccttcc 

1801 ttgaccctggaaggtgccactcccactgtcctttcctaataaaatgagga 

1851 aattgcatcgcattgtctgagtaggtgtcattctattctggggggtgggg 

19 01 tggggcaggacagcaagggggaggattgggaagacaatagcaggcatgct 

1951 ggggatgcggtgggctc tatggaaccagc tggggctcgacagc tcgagc t 

2 001 agctttgcttctcaatttcttatttgcataatgagaaaaaaaggaaaatt 

2 051 aattttaacaccaattcagtagttgattgagcaaatgcgttgccaaaaag 

2101 gatgctttagagacagtgttctctgcacagataaggacaaacattattca 

2151 gagggagtacccagagctgagactcctaagccagtgagtggcacagcatt 



WO 00/69462 



16 / 21 



PCT/US00/13694 



FIGURE 9C 

22 01 ctagggagaaatatgcttgtcatcaccgaagcctgattccgtagagccac 

22 51 accttggtaagggccaatctgctcacacaggatagagagggcaggagcca 

23 01 gggcagagcatataaggtgaggtaggatcagttgctcctcacatttgct t 
2 3 51 ctgacatagttgtgttgggagcttggatcgatccaccatggttgaacaag 

24 01 atggattgcacgcaggttctccggccgcttgggtggagaggctattcggc 
2451 tatgactgggcacaacagacaatcggctgctctgatgccgccgtgttccg 

25 01 gctgtcagcgcaggggcgcccggttctttttgtcaagaccgacctgtccg 
2 5 51 gtgccc tgaatgaactgcaggacgaggcagcgcggctatcgtggctggcc 
2 601 acgacgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcggg 
2 6 51 aagggactggctgctattgggcgaagtgccggggcaggatctcctgtcat 
27 01 ctcaccttgctcctgccgagaaagtatccatcatggctgatgcaatgcgg 
27 51 cggctgcatacgcttgatccggctacctgcccat tcgaccaccaagcgaa 
2 8 01 acatcgcatcgagcgagcacgtactcggatggaagccggtcttgtcgatc 

2 8 51 aggatgatc tggacgaagagcatcaggggctcgcgccagccgaac tgttc 
29 01 gccaggctcaaggcgcgcatgcccgacggcgaggatctcgtcgtgaccca 
2951 tggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttc tg 

3 0 01 gattcatcgactgtggccggctgggtgtggcggaccgctatcaggacata 
3 051 gcgttggctacccgtgatattgctgaagagc ttggcggcgaatgggctga 
3101 ccgcttcctcgtgctttacggtatcgccgctcccgattcgcagcgcatcg 
3151 ccttctatcgccttcttgacgagttcttctgagcgggactctggggttcg 
32 01 aaatgaccgaccaagcgacgcccaacctgccatcacgagatttcgattcc 

32 51 accgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgc 

33 01 cggctggatgatcctccagcgcggggatctcatgctggagttcttcgccc 
33 51 accccaacttgtttattgcagcttataatggttacaaataaagcaatagc 
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FIGURE 9D 

34 01 atcacaaat ttcacaaataaagcatttttttcactgcattc tagttgtgg 

3 451 tttgtccaaactcatcaatgtatcttatcatgtctggatcgcggccgcga 

3 501 tcccgtcgagagcttggcgtaatcatggtcatagc tgt ttcc tgtgtgaa 

3 551 attgttatccgctcacaattccacacaacatacgagccggaagcataaag 

3 601 tgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcgtt 

3 651 gcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcatt 

37 01 aatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctct 

3 751 tccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcg 

3 801 agcggtatcagctcactcaaaggcggtaatacggt tatccacagaatcag 

3 851 gggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccagg 

3901 aaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccc 

3 951 tgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccga 
40 01 caggactataaagataccaggcgtttccccctggaagctccctcgtgcgc 
4051 tctcctgttccgaccctgccgcttaccggatacctgtccgcctttctccc 
4101 ttcgggaagcgtggcgctttctcaatgctcacgctgtaggtatctcagtt 
4151 cggtgtaggtcgttcgc tccaagctgggctgtgtgcacgaaccccccgt t 
42 01 cagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaaccc 

42 51 ggtaagacacgacttatcgccactggcagcagccactggtaacaggatta 

43 01 gcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcct 
4351 aactacggctacactagaaggacagtatttggtatctgcgctctgctgaa 
4401 gccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaa 
4451 ccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgc 

4 501 agaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctga 
4551 cgctcagtggaacgaaaactcacgttaagggattttggtcatgagattat 
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FIGURE 9E 

46 01 caaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaa 

4 651 tcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgctt 

47 01 aatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatag 
47 51 ttgcctgactccccgtcgtgtagataactacgatacgggagggcttacca 
4801 tctggccccagtgctgcaatgataccgcgagacccacgctcaccggctcc 
4851 agatttatcagcaataaaccagccagccggaagggccgagcgcagaagtg 

49 01 gtcctgcaactttatccgcctccatccagtctattaattgttgccgggaa 
4951 gctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccat 

50 01 tgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattca 
5051 gctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgc 
5101 aaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagtt 
5151 ggccgcagtgttatcactcatggttatggcagcactgcataattctctta 
52 01 ctgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaacc 

52 51 aagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggc 

53 01 gtcaatacgggataataccgcgccacatagcagaactttaaaagtgctca 
53 51 tcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctg 
5401 ttgagatccagttcgatgtaacccac teg tgcacccaac tgatctt cage 
5451 atcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaa 
5501 atgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcata 
5551 ctcttcctttttcaatattattgaagcatttatcagggttattgtctcat 

5 601 gagcggatacatatttgaatgtatttagaaaaataaacaaataggggttc 
5651 cgcgcacatttccccgaaaagtgccacct 
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FIGURE 10A 

EcoRI 

gaattctgagca 1000 

cacaggacctcacc atg ggatggagctgtatcatcctcttcttggtagca 10 5 0 
MGWSCIILFLVA 

acagctacaggtgtccactccgaggtgcagctggtggagtctgggggagg 110 0 
T A T G V H S E V Q L V E S - 
N- term 

cttggtacagcctggggggtccctgagactctcctgcgcagcctctggag 115 0 

tctccctcagtggatacaagatgaactgggtccgccaggctccagggaag 12 0 0 

gggctggaatgggtctcttccattactggtatgagtaattacatacacta 12 50 

ctcagactcagtgaagggccgattcaccatctccagagacaacgccatga 13 00 

actcactgtatctgcaaatgaacagcctgacagccgaggacacgggtgtt 13 50 

tattattgtgcgacacaaccgggggagctggcgccttttgaccattgggg 14 0 0 

Bspl20I 

ccagggaaccctggtcaccgtctcctcagcctccaccaa gggccc atcgg 1450 

tcttccccctggcaccctcctccaagagcacctctgggggcacagcggcc 15 0 0 

ctgggctgcctggtcaaggactacttccccgaaccggtgacggtgtcgtg 155 0 

gaactcaggcgccctgaccagcggcgtgcacaccttcccggctgtcctac 1600 

agtcctcaggactctactccctcagcagcgtggtgaccgtgccctccagc 165 0 

agcttgggcacccagacctacatctgcaacgtgaatcacaagcccagcaa 17 0 0 

caccaaggtggacaagaaagttgagcccaaatcttgtgacaaaactcaca 175 0 

catgcccaccgtgcccagcacctgaactcctggggggaccgtcagtcttc 18 0 0 

ctcttccccccaaaacccaaggacaccctcatgatctcccggacccctga 1850 

ggtcacatgcgtggtggtggacgtgagccacgaagaccctgaggtcaagt 19 0 0 

tcaactggtacgtggacggcgtggaggtgcataatgccaagacaaagccg 195 0 

cgggaggagcagtacaacagcacgtaccgggtggtcagcgtcctcaccgt 2 0 00 

cc tgcaccaggactggctgaatggcaaggagtacaagtgcaaggtctcca 2 05 0 
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FIGURE 10B 

acaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggg 210 0 

cagccccgagaaccacaggtgtacaccctgcccccatcccgggatgagct 215 0 

gaccaagaaccaggtcagcctgacctgcctggtcaaaggcttctatccca 22 0 0 

gcgacatcgccgtggagtgggagagcaatgggcagccggagaacaactac 225 0 

aagaccacgcctcccgtgctggactccgacggctccttcttcctctacag 23 0 0 

caagctcaccgtggacaagagcaggtggcagcaggggaacgtcttctcat 23 5 0 

gctccgtgatgcatgaggctctgcacaaccactacacgcagaagagcctc 240 0 
tccctgtctccgggtaaa tga tagatatct 
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FIGURE 11 

EcoRI 

gaattc tgagca 1000 

cacaggacctcacc atg ggatggagctgtatcatcctcttcttggtagca 10 5 0 
MGWSCIILFLVA 

acagctacaggtgtccactcc cagtct gtgttgacgcagccgccctcagt 110 0 
T A T G V H S Q S V L T Q - 
N-term 

ctctgcggccccaggacagaaggtcaccatctcctgcactgggagcagct 115 0 

ccaacctcggggcaggttatgatgttcactggtaccggcaacttccaggg 12 0 0 

acagcccccaaactcctcatctatgataacaacaatcggccctcaggggt 12 5 0 

ccctgaccgattctctggctccaagtctggcccc tcagcc tccctggcca 13 0 0 

tctctgggctccaggctgaggatgaggctgattattactgccagtcctat 13 5 0 

gacagcagcctgaatggttatgtcttcggaactgggacccagctcaccgt 14 0 0 
Avrll 

cctaggtcagcccaaggctgccccctcggtcactctgttcccgccctcct 145 0 

ctgaggagcttcaagccaacaaggccacactggtgtgtctcataagtgac 1500 

ttctacccgggagccgtgacagtggcctggaaggcaattagcagccccgt 155 0 

caaggcgggagtggagaccaccacaccctccaaacaaagcaacaacaagt 160 0 

acgcggccagcagctatctgagcctgacgcctgagcagtggaagtcccac 165 0 

agaaggtacagctgccaggtcacgcatgaagggagcaccgtggagaagac 17 0 0 

agtggcccctacagaatgttcatagttctagatctacgtatgatcagcct 17 5 0 
P T E C S * 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: SmithKline Beechara, PLC 
(ii) TITLE OF INVENTION: Human Monoclonal Antibody 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 7 09 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-2799 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: King, William T. 
<B) REGISTRATION NUMBER: 3 0,954 
(C) REFERENCE/DOCKET NUMBER: # 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 610-270-4800 
<B) TELEFAX: 610-270-4026 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAG TCT GTG TTG ACG CAG CCG CCC TCA GTC TCT GCG GCC CCA GGA CAG 4 8 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 

15 10 15 

AAG GTC ACC ATC TCC TGC ACT GGG AGC AGC TCC AAC CTC GGG GCA GGT 9 6 
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Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 
2 0 2 5 3 0 

TAT GAT GTT CAC TGG TAC CGG CAA CTT CCA GGG ACA GCC CCC AAA CTC 
Tyr Asd Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 
35 40 45 

CTC ATC TAT GAT AAC AAC AAT CGG CCC TCA GGG GTC CCT GAC CGA TTC '. 
Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

TCT GGC TCC AAG TCT GGC CCC TCA GCC TCC CTG GCC ATC TCT GGG CTC I 
Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 
65 70 75 80 

CAG GCT GAG GAT GAG GCT GAT TAT TAC TGC CAG TCC TAT GAC AGC AGC : 
Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 
85 90 95 

CTG AAT GGT TAT GTC TTC GGA ACT GGG ACC CAG CTC ACC GTC CTA GGT '. 
Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu Gly 
100 105 110 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 
15 10 15 

Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 

20 25 30 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 

35 40 45 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 
65 70 75 80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 
85 90 95 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu Gly 
100 105 110 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .3 57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAG GTG CAG CTG GTG GAG TCT GGG GGA GGC TTG GTA CAG CCT GGG GGG 
Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

TCC CTG AGA CTC TCC TGC GCA GCC TCT GGA GTC TCC CTC AGT GGA TAC 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tyr 

20 25 30 

AAG ATG AAC TGG GTC CGC CAG GCT CCA GGG AAG GGG CTG GAA TGG GTC 1 
Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

TCT TCC ATT ACT GGT ATG AGT AAT TAC ATA CAC TAC TCA GAC TCA GTG 1 
Ser Ser He Thr Gly Met Ser Asn Tyr He His Tyr Ser Asp Ser Val 
50 55 60 

AAG GGC CGA TTC ACC ATC TCC AGA GAC AAC GCC ATG AAC TCA CTG TAT 2 
Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

CTG CAA ATG AAC AGC CTG ACA GCC GAG GAC ACG GGT GTT TAT TAT TGT 2 
Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 
85 90 95 

GCG ACA CAA CCG GGG GAG CTG GCG CCT TTT GAC CAT TGG GGC CAG GGA 3 
Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 
100 105 110 



ACC CTG GTC ACC GTC TCC TCA 
Thr Leu Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tyr 



Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
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Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser Ass Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 
85 90 95 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 
100 105 110 



Thr Leu Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly GIv 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tvr 
20 25 30 

Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ser Ser lie Thr Gly Met Ser Asn Tyr He His Tyr Ser Asp Ser Val 

50 55 60 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 
85 90 95 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Giv 
100 105 110 

Thr Leu Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser Tyr 

20 25 30 

Glu Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ser Tyr He Ser Ser Ser Gly Ser Thr He Tyr Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE: 
Met Gly Trp Ser 



Val His Ser Glu 
20 

Pro Gly Gly Ser 
35 

Ser Gly Tyr Lys 
50 

Glu Trp Val Ser 
65 

Asp Ser Val Lys 



3CRIPTION: SEQ Ii 

Cys He He Leu 
5 

Val Gin Leu Leu 



Leu Arg Leu Ser 
40 



Met Asn Trp Val 
55 

Ser He Thr Gly 
70 



Gly Arg Phe Thr 
85 



3 NO : 7 : 

Phe Leu Val Ala 
10 

Glu Ser Gly Gly 
25 

Cys Ala Ala Ser 



Arg Gin Ala Pro 
60 



Met Ser Asn Tyr 
75 



He Ser Arg Asp 
90 



Thr Ala Thr Gly 
15 

Gly Leu Val Gin 
30 

Gly Val Ser Leu 
45 

Gly Lys Gly Leu 

He His Tyr Ser 
80 

Asn Ala Met Asn 
95 



Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val 
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110 



Tyr Tyr Cys Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp 
115 120 125 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 
15 10 15 

Val His Ser Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin 

20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu 
35 40 45 

Ser Gly Tyr Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
50 55 60 

Glu Trp Val Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser 
65 70 75 80 

Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn 
85 90 95 

Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val 

100 105 110 

Tyr Tyr Cys Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp 
115 120 125 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 



WO 00/69462 7 PCT/US00/13694 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 

15 10 15 

Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 
20 25 30 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 
35 40 45 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 
65 70 75 80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 
85 90 95 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Gly Ala Pro Gly Gin 

15 10 15 

Arg Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn lie Gly Ala Gly 

20 25 30 

Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys Leu 
35 40 45 

Leu lie Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala lie Thr Gly Leu 
65 70 75 80 



Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys 

85 90 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 

15 10 15 

Val His Ser Glu Leu Thr Gin Pro Pro Ser Val Ser Gly Ala Pro Gly 

20 25 30 

Gin Arg Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn lie Gly Ala 

35 40 45 

Gly Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys 
50 55 60 

Leu Leu lie Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro Asp Arg 
65 70 75 80 

Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala lie Thr Gly 

85 90 95 

Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser 
100 105 110 

Ser Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 
15 10 15 

Val His Ser Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Gly Ala 
20 25 30 

Pro Gly Gin Arg Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn lie 
35 40 45 

Gly Ala Gly Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala 
50 55 60 



Pro Lys Leu Leu lie Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro 
65 70 75 80 
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Asp Arg Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala lie 

85 90 95 

Thr Gly Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr 
100 105 110 

Asp Ser Ser Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr 
115 120 125 

Val Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 6 0 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 2 40 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCGG 3 60 

GGATCGATCC GTCGACGTAC GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 42 0 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 480 

CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 54 0 

ATAGGGACTT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC CCACTTGGCA 6 00 

GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 660 

CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 72 0 

TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 780 

GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 840 

TTGTTTTGGC AC C AAAATC A ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 900 

ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA T AAGC AG AG C TGGGTACGTG 9 60 

AACCGTCAGA TCGCCTGGAG ACGCCATCGA ATTCTGAGCA CACAGGACCT CACCATGGGA 102 0 

TGGAGCTGTA TCATCCTCTT CTTGGTAGCA ACAGCTACAG GTGTCCACTC CGAGGTCCAA 10 8 0 
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CTGCTCGAGT 

GCCTCTGGAG 
GGGCTGGAAT 
GTGAAGGGCC 
AACAGCCTGA 
GCGCCTTTTG 
GGCCCATCGG 
CTGGGCTGCC 
GCCCTGACCA 
CTCAGCAGCG 
GTGAATCACA 
AAAACTCACA 
CTCTTCCCCC 
GTGGTGGTGG 
GTGGAGGTGC 
GTGGTCAGCG 
AAGGTCTCCA 
CAGCCCCGAG 
CAGGTCAGCC 
GAGAGCAATG 
GGCTCCTTCT 
GTCTTCTCAT 
TCCCTGTCTC 
GTTGCCAGCC 
CTCCCACTGT 
ATTCTATTCT 
GCAGGCATGC 
TCTCCCGATC 
ATTAATTTTA 
TAGAGACAGT 
TGAGACTCCT 
GAAGCCTGAT 



CTGGGGGAGG 
TCTCCCTCAG 
GGGTCTCTTC 
GATTCACCAT 
CAGCCGAGGA 
ACCATTGGGG 
TCTTCCCCCT 
TGGTCAAGGA 
GCGGCGTGCA 
TGGTGACCGT 
AGCCCAGCAA 
CATGCCCACC 
CAAAACCCAA 
ACGTGAGCCA 
ATAATGCCAA 
TCCTCACCGT 
ACAAAGCCCT 
AACCACAGGT 
TGACCTGCCT 
GGCAGCCGGA 
TCCTCTACAG 
GCTCCGTGAT 
CGGGTAAATG 
ATCTGTTGTT 
CCTTTCCTAA 
GGGGGGTGGG 
TGGGGATGCG 
CCCAGCTTTG 
ACACCAATTC 
GTTCTCTGCA 
AAGCCAGTGA 
TCCGTAGAGC 



CTTGGTACAG 
TGGATACAAG 
CATTACTGGT 
CTCCAGAGAC 
CACGGGTGTT 
CCAGGGAACC 
GGCACCCTCC 
CTACTTCCCC 
CACCTTCCCG 
GCCCTCCAGC 
CACCAAGGTG 
GTGCCCAGCA 
GGACACCCTC 
CGAAGACCCT 
GACAAAGCCG 
CCTGCACCAG 
CCCAGCCCCC 
GTACACCCTG 
GGTCAAAGGC 
GAACAACTAC 
CAAGCTCACC 
GCATGAGGCT 
AT AG AT ATC T 
TGCCCCTCCC 
TAAAATGAGG 
GTGGGGCAGG 
GTGGGCTCTA 
CTTCTCAATT 
AGTAGTTGAT 
CAGATAAGGA 
GTGGCACAGC 
CACACCTTGG 



10 

CCTGGGGGGT 
ATGAACTGGG 
ATGAGTAATT 
AACGCCATGA 
TATTATTGTG 
CTGGTCACCG 
TCCAAGAGCA 
GAACCGGTGA 
GCTGTCCTAC 
AGCTTGGGCA 
GACAAGAAAG 
CCTGAACTCC 
ATGATCTCCC 
GAGGTCAAGT 
CGGGAGGAGC 
GACTGGCTGA 
ATCGAGAAAA 
CCCCCATCCC 
TTCTATCCCA 
AAGACCACGC 
GTGGACAAGA 
CTGCACAACC 
ACGTATGATC 
CCGTGCCTTC 
AAATTGCATC 
ACAGCAAGGG 
TGGAACCAGC 
TCTTATTTGC 
TGAGCAAATG 
CAAACATTAT 
ATTCTAGGGA 
TAAGGGCCAA 



CCCTGAGACT 
TCCGCCAGGC 
ACATACACTA 
ACTCACTGTA 
CGACACAACC 
TCTCCTCAGC 
CCTCTGGGGG 
CGGTGTCGTG 
AGTCCTCAGG 
CCCAGACCTA 
TTGAGCCCAA 
TGGGGGGACC 
GGACCCCTGA 
TCAACTGGTA 
AGTACAACAG 
ATGGCAAGGA 
CCATCTCCAA 
GGGATGAGCT 
GCGACATCGC 
CTCCCGTGCT 
GCAGGTGGCA 
ACTACACGCA 
AGCCTCGACT 
CTTGACCCTG 
GCATTGTCTG 
GGAGGATTGG 
TGGGGCTCGA 
ATAATGAGAA 
CGTTGCCAAA 
TCAGAGGGAG 
GAAATATGCT 
TCTGCTCACA 
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CTCCTGCGCA 114 0 

TCCAGGGAAG 12 0 0 

CTCAGACTCA 12 6 0 

TCTGCAAATG 13 2 0 

GGGGGAGCTG 13 8 0 

CTCCACCAAG 14 4 0 

CACAGCGGCC 15 00 

GAACTCAGGC 15 60 

ACTCTACTCC 162 0 

CATCTGCAAC 16 8 0 

ATCTTGTGAC 17 4 0 

GTCAGTCTTC 18 00 

GGTCACATGC 18 60 

CGTGGACGGC 19 2 0 

CACGTACCGG 19 8 0 

GTACAAGTGC 2 0 40 

AGCCAAAGGG 2100 

GACCAAGAAC 216 0 

CGTGGAGTGG 2 2 2 0 

GGACTCCGAC 22 8 0 

GCAGGGGAAC 23 4 0 

GAAGAGCCTC 2400 

GTGCCTTCTA 246 0 

GAAGGTGCCA 2 52 0 

AGTAGGTGTC 2 58 0 

GAAGACAATA 2 64 0 

CAGCGCTGGA 2700 

AAAAAG G AAA 2 7 6 0 

AAGGATGCTT 2 82 0 

TACCCAGAGC 2 880 

TGTCATCACC 2 94 0 

CAGGATAGAG 3 0 0 0 
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AGGGCAGGAG CCAGGGCAGA GCATATAAGG TGAGGTAGGA TCAGTTGCTC CTCACATTTG 3 06 0 

CTTCTGACAT AGTTGTGTTG GGAGCTTGGA TAGCTTGGAC AGCTCAGGGC TGCGATTTCG 312 0 

CGCCAAACTT GACGGCAATC CTAGCGTGAA GGCTGGTAGG ATTTTATCCC CGCTGCCATC 318 0 

ATGGTTCGAC CATTGAACTG CATCGTCGCC GTGTCCCAAA ATATGGGGAT TGGCAAGAAC 3 24 0 

GGAGACCTAC CCTGGCCTCC GCTCAGGAAC GAGTTCAAGT ACTTCCAAAG AATGACCACA 330 0 

ACCTCTTCAG TGGAAGGTAA ACAGAATCTG GTGATTATGG GTAGGAAAAC CTGGTTCTCC 33 60 

ATTCCTGAGA AGAATCGACC TTTAAAGGAC AGAATTAATA TAGTTCTCAG TAGAGAACTC 3 42 0 

AAAGAACCAC CACGAGGAGC TCATTTTCTT GCCAAAAGTT TGGATGATGC CTTAAGACTT 3480 

ATTGAACAAC CGGAATTGGC AAGTAAAGTA GACATGGTTT GGATAGTCGG AGGCAGTTCT 3 540 

GTTTACCAGG AAGCCATGAA TCAACCAGGC CACCTTAGAC TCTTTGTGAC AAGGATCATG 3 60 0 

CAGGAATTTG AAAGTGACAC GTTTTTCCCA GAAATTGATT TGGGGAAATA TAAACTTCTC 3 660 

CCAGAATACC CAGGCGTCCT CTCTGAGGTC CAGGAGGAAA AAGGCATCAA GTATAAGTTT 3 72 0 

GAAGTCTACG AGAAGAAAGA CTAACAGGAA GATGCTTTCA AGTTCTCTGC TCCCCTCCTA 37 8 0 

AAGCTATGCA TTTTTATAAG ACCATGGGAC TTTTGCTGGC TTTAGATCAG CCTCGACTGT 3 84 0 

GCCTTCTAGT TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT TGACCCTGGA 3 90 0 

AGGTGCCACT CCCACTGTCC TTTCCTAATA AAATGAGGAA ATTGCATCGC ATTGTCTGAG 3 960 

TAGGTGTCAT TCTATTCTGG GGGGTGGGGT GGGGCAGGAC AGCAAGGGGG AGGATTGGGA 4 02 0 

AGACAATAGC AGGCATGCTG GGGATGCGGT GGGCTCTATG GAACCAGCTG GGGCTCGATC 40 8 0 

GAGTGTATGA CTGCGGCCGC GATCCCGTCG AGAGCTTGGC GTAATCATGG TCATAGCTGT 414 0 

TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC GGAAGCATAA 42 0 0 

AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC ATTAATTGCG TTGCGCTCAC 42 6 0 

TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA TTAATGAATC GGCCAACGCG 432 0 

CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 43 8 0 

GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 444 0 

CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 45 0 0 

GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 45 6 0 

ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC G AC AGG AC T A T AAAG AT AC C 4 62 0 

AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 4680 

GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCAATGC TCACGCTGTA 474 0 

GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 4800 

TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 4 860 
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ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG 4 92 0 

GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT 49 8 0 

TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 5 040 

CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 510 0 

GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 5160 

GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG ATCTTCACCT 52 2 0 

AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 52 80 

GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 53 40 

GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC TACGATACGG GAGGGCTTAC 54 0 0 

CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT 54 60 

C AG C AATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ACTTTATCCG 552 0 

CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG CCAGTTAATA 55 80 

GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT GTCACGCTCG TCGTTTGGTA 5 64 0 

TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC CCGATGTTGT 57 0 0 

GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG 5760 

TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG CCATCCGTAA 5 82 0 

GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC 58 80 

GACCGAGTTG CTCTTGCCCG GCGTCAATAC GGGATAATAC CGCGCCACAT AGCAGAACTT 5 940 

TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATGTTACCGC 6000 

TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA CTGATC TTC A GCATCTTTTA 60 6 0 

CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 612 0 

TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA 6180 

TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 62 4 0 

AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCCACC T 62 81 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 6 0 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 2 40 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCGG 3 60 

GGATCGATGC GTCGACGTAC GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 42 0 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 4 80 

CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 54 0 

ATAGGG AC TT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC CCACTTGGCA 6 00 

GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 660 

CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGG AC T TTCCTACTTG GCAGTACATC 720 

TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 7 80 

GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 840 

TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 9 00 

ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC TGGGTAC GTG 9 60 

AACCGTCAGA TCGCCTGGAG ACGCCATCGA ATTCTGAGCA CACAGGACCT CACCATGGGA 102 0 

TGGAGCTGTA TCATCCTCTT CTTGGTAGCA ACAGCTACAG GTGTCCACTC CGAGCTCACG 10 80 

CAGCCGCCCT CAGTCTCTGC GGCCCCAGGA CAGAAGGTCA CCATCTCCTG CACTGGGAGC 114 0 

AGCTCCAACC TCGGGGCAGG TTATGATGTT CACTGGTACC GGCAACTTCC AGGGACAGCC 12 0 0 

CCCAAACTCC TCATCTATGA TAACAACAAT CGGCCCTCAG GGGTCCCTGA CCGATTCTCT 12 6 0 

GGCTCCAAGT CTGGCCCCTC AGCCTCCCTG GCCATCTCTG GGCTCCAGGC TGAGGATGAG 13 2 0 

GCTGATTATT ACTGCCAGTC C TATG AC AG C AGCCTGAATG GTTATGTCTT CGGAACTGGG 13 80 

ACCCAGCTCA CCGTCCTAGG TCAGCCCAAG GCTGCCCCCT CGGTCACTCT GTTCCCGCCC 144 0 

TCCTCTGAGG AGCTTCAAGC CAACAAGGCC AC ACTGGTG T GTCTCATAAG TGACTTCTAC 15 0 0 

CCGGGAGCCG TGACAGTGGC CTGGAAGGCA ATTAGCAGCC CCGTCAAGGC GGGAGTGGAG 15 6 0 

ACCACCACAC CCTCCAAACA AAGCAACAAC AAGTACGCGG CCAGCAGCTA TCTGAGCCTG 162 0 

ACGCCTGAGC AGTGGAAGTC C C AC AGAAGG TACAGCTGCC AGGTCACGCA TGAAGGGAGC 1680 

ACCGTGGAGA AGACAGTGGC CCCTACAGAA TGTTCATAGT TCTAGATCTA CGTATGATCA 174 0 

GCCTCGACTG TGCCTTCTAG TTGCCAGCCA TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC 1800 

TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG 1860 

CATTGTCTGA GTAGGTGTCA TTCTATTCTG GGGGGTGGGG TGGGGCAGGA CAGCAAGGGG 192 0 



WO 00/69462 ^ 4 PCT/USOO/13694 

GAGGATTGGG AAGACAATAG CAGGCATGCT GGGGATGCGG TGGGCTCTAT GGAACCAGCT 19 8 0 

GGGGCTCGAC AGCTCGAGCT AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA 2 04 0 

AAGGAAAATT AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG 210 0 

GATGCTTTAG AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC 216 0 

CCAGAGCTGA GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT 22 2 0 

CATCACCGAA GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG 22 8 0 

GATAGAGAGG GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC 234 0 

ACATTTGCTT CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCACCATG GTTGAACAAG 2400 

ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG 246 0 

CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 2 52 0 

CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG 2580 

CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 2 64 0 

CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 270 0 

CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA 2760 

CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC 2 82 0 

GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC 28 8 0 

TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG 2 94 0 

TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG 3 000 

GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA 3 0 60 

CCCGTGATAT TGC TGAAG AG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG 312 0 

GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 3180 

GAGCGGGACT CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA 32 4 0 

TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATCGTTT TCCGGGACGC 33 00 

CGGCTGGATG ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC ACCCCAACTT 33 6 0 

GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA 3 42 0 

AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA 348 0 

TGTCTGGATC GCGGCCGCGA TCCCGTCGAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT 3 54 0 

CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG 3 6 00 

TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG 3 660 

CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG 3 72 0 

GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC 378 0 
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TCGGTCGTTC GGCTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT ACGGTTATCC 3 84 0 

ACAGAATCAG GGGATAACGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG 3 900 

AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT 3 960 

CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG 4 02 0 

GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA 40 8 0 

TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG 414 0 

TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT 42 0 0 

CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC 42 6 0 

GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC 43 2 0 

GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT AC AC T AGAAG GACAGTATTT 43 8 0 

GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC 444 0 

GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC 450 0 

AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG 45 6 0 

AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG 4 62 0 

ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG 468 0 

TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT 474 0 

TCATCCATAG TTGCCTGACT CCCCGTCGTG T AG AT AAC T A CGATACGGGA GGGCTTACCA 4 8 00 

TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA 4860 

GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC 49 2 0 

TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT 4980 

TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG 5 04 0 

GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC 5100 

AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG 5160 

TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA 522 0 

TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA 52 8 0 

CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA 53 4 0 

AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG 54 00 

TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT 54 6 0 

TTCACCAGCG TTTCTGGGTG AGCAAAAACA G G AAGGC AAA ATGCCGCAAA AAAGGGAATA 552 0 

AGGGCGACAC GGAAATGTTG AAT AC TC AT A CTCTTCCTTT TTCAATATTA TTGAAGCATT 55 80 

TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA 5 64 0 

ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCT 5 67 9 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 42 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAATTCTGAG CACACAGGAC CTCACCATGG GATGGAGCTG TATCATCCTC TTCTTGGTAG 6 0 

CAACAGCTAC AGGTGTCCAC TCCGAGGTGC AGCTGGTGGA GTCTGGGGGA GGCTTGGTAC 12 0 

AGCCTGGGGG GTCCCTGAGA CTCTCCTGCG CAGCCTCTGG AGTCTCCCTC AGTGGATACA 180 

AGATGAACTG GGTCCGCCAG GCTCCAGGGA AGGGGCTGGA ATGGGTCTCT TCCATTACTG 240 

GTATGAGTAA TTACATACAC TACTCAGACT CAGTGAAGGG CCGATTCACC ATCTCCAGAG 3 00 

ACAACGCCAT GAACTCACTG TATCTGCAAA TGAACAGCCT GACAGCCGAG GACACGGGTG 3 60 

TTTATTATTG TGCGACACAA CCGGGGGAGC TGGCGCCTTT TGACCATTGG GGCCAGGGAA 42 0 

CCCTGGTCAC CGTCTCCTCA GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT 480 

CCTCCAAGAG CACCTCTGGG GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC 54 0 

CCGAACCGGT GACGGTGTCG TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC 600 

CGGCTGTCCT ACAGTCCTCA GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA 660 

GCAGCTTGGG CACCCAGACC TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG 72 0 

TGGACAAGAA AGTTGAGCCC AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG 7 80 

CACCTGAACT CCTGGGGGGA CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC 84 0 

TCATGATCTC CCGGACCCCT GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC 900 

CTGAGGTCAA GTTCAACTGG TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC 960 

CGCGGGAGGA GCAGTACAAC AGCACGTACC GGGTGGTCAG CGTCCTCACC GTCCTGCACC 102 0 

AGGACTGGCT GAATGGCAAG GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC 10 8 0 

CCATCGAGAA AACCATCTCC AAAGCCAAAG GGCAGCCCCG AGAAC C AC AG GTGTACACCC 114 0 

TGCCCCCATC CCGGGATGAG CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG 12 00 

GCTTCTATCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT 12 6 0 

ACAAGACCAC GCCTCCCGTG CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA 13 2 0 

CCGTGGACAA GAGCAGGTGG C AG CAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG 13 8 0 

CTCTGCACAA CCACTACACG CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGATAGATAT 144 0 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAATTCTGAG CACACAGGAC CTCACCATGG GATGGAGCTG TATCATCCTC TTCTTGGTAG 6 0 

CAACAGCTAC AGGTGTCCAC TCCCAGTCTG TGTTGACGCA GCCGCCCTCA GTCTCTGCGG 12 0 

CCCCAGGACA GAAGGTCACC ATCTCCTGCA CTGGGAGCAG CTCCAACCTC GGGGCAGGTT 180 

ATGATGTTCA CTGGTACCGG CAACTTCCAG GGACAGCCCC CAAACTCCTC ATCTATGATA 240 

ACAACAATCG GCCCTCAGGG GTCCCTGACC GATTCTCTGG CTCCAAGTCT GGCCCCTCAG 3 00 

CCTCCCTGGC CATCTCTGGG CTCCAGGCTG AGGATGAGGC TGATTATTAC TGCCAGTCCT 3 60 

ATGACAGCAG CCTGAATGGT TATGTCTTCG GAACTGGGAC CCAGCTCACC GTCCTAGGTC 42 0 

AGCCCAAGGC TGCCCCCTCG GTCACTCTGT TCCCGCCCTC CTCTGAGGAG CTTCAAGCCA 48 0 

ACAAGGCCAC ACTGGTGTGT CTCATAAGTG ACTTCTACCC GGGAGCCGTG AC AG TGGCCT 540 

GGAAGGCAAT TAGCAGCCCC GTCAAGGCGG GAG TGG AG AC CACCACACCC TCCAAACAAA 600 

GCAACAACAA GTACGCGGCC AGCAGCTATC TGAGCCTGAC GCCTGAGCAG TGGAAGTCCC 660 

ACAGAAGGTA CAGCTGCCAG GTCACGCATG AAGGGAGCAC CGTGGAGAAG ACAGTGGCCC 72 0 

CTACAGAATG TTCATAGTTC TAGATCTACG TATGATCAGC CT 7 62 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Val Gin Leu Leu Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Glu Val Gin Leu Val Glu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1899 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 14.. 1735 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGGGCAAATA ACA ATG GAG TTG CTA ATC CTC AAA GCA AAT GCA ATT ACC 
Met Glu Leu Leu lie Leu Lys Ala Asn Ala lie Thr 
15 10 

ACA ATC CTC ACT GCA GTC ACA TTT TGT TTT GCT TCT GGT CAA AAC ATC 
Thr lie Leu Thr Ala Val Thr Phe Cys Phe Ala Ser Gly Gin Asn lie 

15 20 25 

ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTT AGC AAA GGC TAT 
Thr Glu Glu Phe Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr 
30 35 40 

CTT AGT GCT CTG AGA ACT GGT TGG TAT ACC AGT GTT ATA ACT ATA GAA 
Leu Ser Ala Leu Arg Thr Gly Trp Tyr Thr Ser Val lie Thr lie Glu 
45 50 55 60 

TTA AGT AAT ATC AAG GAA AAT AAG TGT AAT GGA ACA GAT GCT AAG GTA 
Leu Ser Asn lie Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val 
65 70 75 

AAA TTG ATA AAA CAA GAA TTA GAT AAA TAT AAA AAT GCT GTA ACA GAA 
Lys Leu lie Lys Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu 
80 85 90 

TTG CAG TTG CTC ATG CAA AGC ACA CCA CCA ACA AAC AAT CGA GCC AGA 
Leu Gin Leu Leu Met Gin Ser Thr Pro Pro Thr Asn Asn Arg Ala Arg 
95 100 105 
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AGA GAA CTA CCA AGG TTT ATG AAT TAT ACA CTC AAC AAT GCC AAA AAA 3 85 

Arg Glu Leu Pro Arg Phe Met Asn Tyr Thr Leu Asn Asn Ala Lys Lys 

110 115 120 

ACC AAT GTA ACA TTA AGC AAG AAA AGG AAA AGA AGA TTT CTT GGT TTT 43 3 

Thr Asn Val Thr Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe 
125 130 135 140 

TTG TTA GGT GTT GGA TCT GCA ATC GCC AGT GGC GTT GCT GTA TCT AAG 4 81 

Leu Leu Gly Val Gly Ser Ala lie Ala Ser Gly Val Ala Val Ser Lys 
145 150 155 

GTC CTG CAC CTA GAA GGG GAA GTG AAC AAG ATC AAA AGT GCT CTA CTA 52 9 

Val Leu His Leu Glu Gly Glu Val Asn Lys lie Lys Ser Ala Leu Leu 
160 165 170 

TCC ACA AAC AAG GCT GTA GTC AGC TTA TCA AAT GGA GTT AGT GTC TTA 5 77 

Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Val Leu 
175 180 185 

ACC AGC AAA GTG TTA GAC CTC AAA AAC TAT ATA GAT AAA CAA TTG TTA 62 5 

Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr Tie Asp Lys Gin Leu Leu 
190 195 200 

CCT ATT GTG AAC AAG CAA AGC TGC AGC ATA TCA AAT ATA GAA ACT GTG 6 73 

Pro lie Val Asn Lys Gin Ser Cys Ser lie Ser Asn lie Glu Thr Val 
205 210 215 220 

ATA GAG TTC CAA CAA AAG AAC AAC AGA CTA CTA GAG ATT ACC AGG GAA 721 
lie Glu Phe Gin Gin Lys Asn Asn Arg Leu Leu Glu lie Thr Arg Glu 
225 230 235 

TTT AGT GTT AAT GCA GGT GTA ACT ACA CCT GTA AGC ACT TAC ATG TTA 769 
Phe Ser Val Asn Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu 
240 245 250 

ACT AAT AGT GAA TTA TTG TCA TTA ATC AAT GAT ATG CCT ATA ACA AAT 817 
Thr Asn Ser Glu Leu Leu Ser Leu lie Asn Asp Met Pro lie Thr Asn 
255 260 265 

GAT CAG AAA AAG TTA ATG TCC AAC AAT GTT CAA ATA GTT AGA CAG CAA 8 65 

Asp Gin Lys Lys Leu Met Ser Asn Asn Val Gin lie Val Arg Gin Gin 
270 275 280 

AGT TAC TCT ATC ATG TCC ATA ATA AAA GAG GAA GTC TTA GCA TAT GTA 913 
Ser Tyr Ser lie Met Ser lie lie Lys Glu Glu Val Leu Ala Tyr Val 
285 290 295 300 

GTA CAA TTA CCA CTA TAT GGT GTT ATA GAT ACA CCC TGT TGG AAA CTA 9 61 

Val Gin Leu Pro Leu Tyr Gly Val lie Asp Thr Pro Cys Trp Lys Leu 
305 310 315 

CAC ACA TCC CCT CTA TGT ACA ACC AAC ACA AAA GAA GGG TCC AAC ATC 10 09 

His Thr Ser Pro Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn He 
320 325 330 

TGT TTA ACA AGA ACT GAC AGA GGA TGG TAC TGT GAC AAT GCA GGA TCA 10 57 

Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser 
335 340 345 



GTA TCT TTC TTC CCA CAA GCT GAA ACA TGT AAA GTT CAA TCA AAT CGA 
Val Ser Phe Phe Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg 
350 355 360 
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GTA TTT TGT GAC ACA ATG AAC AGT TTA ACA TTA CCA AGT GAA ATA AAT 
Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Ser Glu lie Asn 
365 370 375 380 

CTC TGC AAT GTT GAC ATA TTC AAC CCC AAA TAT GAT TGT AAA ATT ATG 
Leu Cys Asn Val Asp lie Phe Asn Pro Lys Tyr Asp Cys Lys lie Met 
385 390 395 

ACT TCA AAA ACA GAT GTA AGC AGC TCC GTT ATC ACA TCT CTA GGA GCC 
Thr Ser Lys Thr Asp Val Ser Ser Ser Val lie Thr Ser Leu Gly Ala 
400 405 410 

ATT GTG TCA TGC TAT GGC AAA ACT AAA TGT ACA GCA TCC AAT AAA AAT 
lie Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn 
415 420 425 

CGT GGA ATC ATA AAG ACA TTT TCT AAC GGG TGC GAT TAT GTA TCA AAT 
Arg Gly He He Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn 
430 435 440 

AAA GGG ATG GAC ACT GTG TCT GTA GGT AAC ACA TTA TAT TAT GTA AAT 
Lys Gly Met Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn 
445 450 455 460 

AAG CAA GAA GGT AAA AGT CTC TAT GTA AAA GGT GAA CCA ATA ATA AAT 
Lys Gin Glu Gly Lys Ser Leu Tyr Val Lys Gly Glu Pro He He Asn 
465 470 475 

TTC TAT GAC CCA TTA GTA TTC CCC TCT GAT GAA TTT GAT GCA TCA ATA 
Phe Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser He 
480 485 490 

TCT CAA GTC AAC GAG AAG ATT AAC CAG AGC CTA GCA TTT ATT CGT AAA 
Ser Gin Val Asn Glu Lys He Asn Gin Ser Leu Ala Phe He Arg Lys 
495 500 505 

TCC GAT GAA TTA TTA CAT AAT GTA AAT GCT GGT AAA TCC ACC ACA AAT 
Ser Asp Glu Leu Leu His Asn Val Asn Ala Gly Lys Ser Thr Thr Asn 
510 515 520 

ATC ATG ATA ACT ACT ATA ATT ATA GTG ATT ATA GTA ATA TTG TTA TCA 
He Met He Thr Thr He He He Val He He Val He Leu Leu Ser 
525 530 535 540 

TTA ATT GCT GTT GGA CTG CTC TTA TAC TGT AAG GCC AGA AGC ACA CCA 
Leu He Ala Val Gly Leu Leu Leu Tyr Cys Lys Ala Arg Ser Thr Pro 
545 550 555 

GTC ACA CTA AGC AAA GAT CAA CTG AGT GGT ATA AAT AAT ATT GCA TTT 
Val Thr Leu Ser Lys Asp Gin Leu Ser Gly He Asn Asn He Ala Phe 
560 565 570 



CATAGACAAC CCATCTGTCA TTGGATTTTC TTAAAATCTG AACTTCATCG AAACTCTCAT 1845 
CTATAAACCA TCTCACTTAC ACTATTTAAG TAGATTCCTA GTTTATAGTT ATAT 189 9 



(2) INFORMATION FOR SEQ ID NO : 2 0 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 

Met Glu Leu Leu lie Leu Lys Ala Asn Ala lie Thr Thr lie Leu Thr 
15 10 15 

Ala Val Thr Phe Cys Phe Ala Ser Gly Gin Asn He Thr Glu Glu Phe 

20 25 30 

Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr Leu Ser Ala Leu 

35 40 45 

Arg Thr Gly Trp Tyr Thr Ser Val He Thr He Glu Leu Ser Asn lie 
50 55 60 

Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val Lys Leu He Lys 
65 70 75 80 

Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu Leu Gin Leu Leu 
85 90 95 



Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu Gly Val 

130 135 140 

Gly Ser Ala He Ala Ser Gly Val Ala Val Ser Lys Val Leu His Leu 

145 150 155 160 

Glu Gly Glu Val Asn Lys He Lys Ser Ala Leu Leu Ser Thr Asn Lys 

165 170 175 



Lys Gin Ser Cys Ser He Ser Asn He Glu Thr Val He Glu Phe Gin 
210 215 220 

Gin Lys Asn Asn Arg Leu Leu Glu He Thr Arg Glu Phe Ser Val Asn 

225 230 235 240 

Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu Thr Asn Ser Glu 

245 250 255 



Leu Met Ser Asn Asn Val Gin He Val Arg Gin Gin Ser Tyr Ser lie 
275 280 285 
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Met Ser lie He Lys Glu Glu Val Leu Ala Tyr Val Val Gin Leu Pro 
290 295 300 

Leu Tyr Gly Val He Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro 
305 310 315 320 

Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn He Cys Leu Thr Arg 
325 330 335 

Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe 
340 345 350 

Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg Val Phe Cys Asp 
355 360 365 

Thr Met Asn Ser Leu Thr Leu Pro Ser Glu He Asn Leu Cys Asn Val 
370 375 380 

Asp He Phe Asn Pro Lys Tyr Asp Cys Lys He Met Thr Ser Lys Thr 
385 390 395 400 

Asp Val Ser Ser Ser Val He Thr Ser Leu Gly Ala He Val Ser Cys 
405 410 415 

Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly He He 
420 425 430 

Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Met Asp 
435 440 445 

Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Gin Glu Gly 
450 455 460 

Lys Ser Leu Tyr Val Lys Gly Glu Pro lie He Asn Phe Tyr Asp Pro 
465 470 475 480 

Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser He Ser Gin Val Asn 
485 490 495 

Glu Lys lie Asn Gin Ser Leu Ala Phe He Arg Lys Ser Asp Glu Leu 
500 505 510 

Leu His Asn Val Asn Ala Gly Lys Ser Thr Thr Asn He Met He Thr 
515 520 525 

Thr He He He Val He He Val He Leu Leu Ser Leu He Ala Val 
530 535 540 

Gly Leu Leu Leu Tyr Cys Lys Ala Arg Ser Thr Pro Val Thr Leu Ser 
545 550 555 560 

Lys Asp Gin Leu Ser Gly He Asn Asn He Ala Phe Ser Asn 
565 570 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
15 10 15 
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